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Abstract 

A new test is proposed for the weak white noise nuh hypothesis. The 
test is based on a new automatic choice of the order for a Box-Pierce or 
Hong test statistic. The test uses Lobato (2001) or Kuan and Lee (2006) 
HAC critical values. The data-driven order choice is tailored to detect 
a new class of alternatives with autocorrelation coefficients which can 
be o(n~^/^) provided there are enough of them. A simulation experi- 
ment illustrates the good behavior of the test both under the weak white 
noise null and the alternative. JEL Classification: Primary C12; Sec- 
ondary C32. Keywords: Weak White Noise Hypothesis; HAC Inference; 
Automatic nonpar ametric tests; Adaptive rate-optimality. 



1 

1. Introduction 

Testing for white noise is important in many econometric contexts. Ignoring autocorrelation 
of the residuals in a linear regression model can lead to erroneous confidence intervals or tests. 
Correlation of residuals from an ARMA model or of the squared residuals from an ARCH 
model can indicate an improper choice of the order. Investigating autocorrelation function 
is also a popular diagnostic tool in Macroeconomics and Finance, see e.g. Durlauf (1991) 
and Campbell, Lo and Craig MacKinlay (1997). 

The earliest tests for white noise were based on confidence intervals for autocorrelation 
coefficients as described in Brockwell and Davies (2006) or Fan and Yao (2005). See also 
Xiao and Wu (2011) who recently derives the asymptotic distribution of the maximum 
standardized sample covariance of weak white noise processes. A second approach was 
estabhshed by Grenander and Rosenblatt (1952) who extended goodness-of-fit tests such as 
Kolmogorov and Cramer- von Mises tests to white noise testing. See also Durlauf (1991), 
Anderson (1993) and Deo (2000). Following the popular Lagrange Multiplier approach, 
Delgado, Hidalgo and Velasco (2005) proposes a modified test statistic to be used with 
estimated residuals. Shao (2011a) has recently extended this setup to cover the weak white 
noise null hypothesis. 

An appealing feature of Cramer- von Mises type tests is detection Pitman local directional 
alternatives converging to the null with the parametric rate n~^/^, where n is the sample 
size. This contrasts with detection results for Box-Pierce type tests as in Hong (1996) 
or Paparodotis (2000) who both consider slower rates of convergence for local alternatives 
defined through the spectral density function. Such a finding suggests that Cramer-von 
Mises tests are more powerful than Box-Pierce ones. One of the contributions of the present 
paper is to deliver an opposite conclusion for a new class of alternatives defined through 
the autocovariance function. The new class of alternatives formalizes the idea that small 
autocorrelation coefficients, say of order p„, can be detected provided that there are enough 
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of them regrouped at reasonably small lags. An important finding of the paper is that 
detection is still possible for very small Pn — o (n~^/^) . The intuition is as follows. As 
seen from Hong (1996), Shao (2011b) and Xiao and Wu (2011), the critical region of the 
Box-Pierce test of order p„ — )■ oo is 

where is a normal critical value. Consider an alternative close enough to the null of 
independence, so that = n}!'^ (jij/Ro — Rj/Roj can be considered as approximately in- 
dependent standard normal, where Rj and Rj denote respectively sample and population 
covariance at lag j. See e.g. Brockwell and Davies (2006, Theorem 11.2.2) for a justification 
for such a rough and intuitive setup. Then taking this approximation as exact gives, since 

{2p^f' {2p^f' {2p^f' {2p^f' 

nyy," RVRl 

This expansion shows that the Box-Pierce test is consistent provided [nj (2p„)^''^j Y^jL\R^jl 
is large enough or diverges. Let be the number of correlation coefficients Rj/Rq > for 
j e [l,p„], so that (n/ (2p„)^/^^ Yl%i Rj/Ro > nNnPl/ {2pnf''^- Hence the Box-Pierce test 
is consistent provided 

a condition which allows for Pn = o (n~^/^) provided there are enough correlation coefficients 

1 /2 

larger than p„, that is N^jpn — >■ oo which holds in particular when the exact order of 
is Pn- In plain words, summing sample correlations as in the Box-Pierce statistic allows 
to detect very small population correlations provided they are not too sparse and concen- 
trated at lags smaller than p„. Such a detection feature is lost by the Cramer- von Mises 
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type tests which weights down high order correlation coefficients or by the Xiao and Wu 
(2011) maximum test. As detailed in Section |4| such alternatives includes MA processes 
with a significant long term multiplier but o (n^^/^) impulse response coefficients. Such pro- 
cesses therefore correspond to a macroeconomic scenario where short term policies have no 
significant effects whereas long term ones may have an impact. 

An important limitation of the critical region ( |1.1 ) is the use of an ad hoc order P„. Hong 
(1996), Shao (2011b) and Xiao and Wu (2011) consider a deterministic p„ — > oo. This 
is inadequate to detect alternatives with low lags correlation: taking p„ = 30 is unlikely 
to give a test with power against popular AR{1) or MA{1) alternatives with a reasonable 
sample size. Conversely, taking a fixed Pn as in the original Box and Pierce (1970) paper 
is not suitable to detect higher order alternatives. The need to properly address the tuning 
of a smoothing parameter which plays a role similar to Pn has spurred the development 
of data-driven approaches for various nonparametric testing problems. A recent approach, 
the so-called adaptive approach, focuses on data-driven tests which detects alternatives in a 
smoothness class converging to the null at the fastest possible rate given that the smoothness 
class is unknown to the test builder. See in particular Fan (1996), Spokoiny (1996), Horowitz 
and Spokoiny (2001), Guerre and Lavergne (2005), Guay and Guerre (2006) and Chen and 
Gao (2007) for various nonparametric models and related null hypotheses of theoretical or 
practical relevance. Golubev, Nussbaum and Zhou (2010) has proved Le Cam equivalence of 
Gaussian time series with spectral density functions in a Besov space and the corresponding 
continuous time Gaussian white noise model considered in Spokoiny (1996). This result, 
limited to Gaussian Time Series, is of theoretical nature and cannot deliver ready to apply 
white noise tests, especially for the weakly correlated alternatives in (??). In fact, most of 
the data-driven choices of pn proposed in the white noise testing literature do not consider 
the adaptive rate-optimality issue. An exception is Fan and Yao (2005) which outlines, but 
do not analyze, a data-driven test which is based on the maximum of a set of Box-Pierce 
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statistics. Popular data-driven choice of the order build on Newey and West (1994), see, 
among others, the simulation section of Hong and Lee (2005). This practice is however 
difficult to justify theoretically since such a data-driven order is expected at best to be 
optimal for estimation of a long run variance, which is the purpose of Newey and West (1994). 
As it is well known, this will not produce an adaptive rate-optimal test since the optimal 
order for testing differs from the estimation one, see Ingster (1993) and Guerre and Lavergne 
(2002). Escanciano and Lobato (2009) proposes a data-driven order choice of the order based 
on an AIC/BIC criterion which is suitable for estimation but not adaptive rate-optimal for 
white-noise testing. This contrasts with the new test proposed here which is adaptive rate- 
optimal with respect to a class of alternatives allowing for correlation coefficients of order 



as in ^IM which, as far as we know, has not been previously considered. 
A third issue addressed in the paper concerns the behavior of the data-driven test under 
the weak white noise hypothesis. For directly observed variables, Escanciano and Lobato 
(2009) considers a more restrictive martingale difference null hypothesis and, as far as we 
know, the null limit distribution of test statistics using a data-driven order as in Newey 
and West (1994) has not been studied yet. Shao (2011b) considers the case of directly 
observed or estimated residuals. He finds that the standardized Box-Pierce statistic has 
a standard normal limit distribution under the weak white noise hypothesis, but provided 
p„ — 7- oo so that the resulting test would have low power against low order AR or MA 
alternatives. When is fixed, available choice of critical values involves the block bootstrap 
as in Romano and Thombs (1996) and Lobato, Horowitz, Nankervis and Savin (2006) or a 
matrix standardization of the sample covariance which considerably modifies the Box Pierce 
statistic as in Lobato, Nankervis and Savin (2002), Francq, Roy and Zakoian (2005) or 
Delgado and Velasco (2010). These critical values involve a block length or a bandwidth for 
which there is no obvious choice. We design instead a data-driven choice p of the order used 
in the test which is asymptotically equal to 1 under the weak white noise hypothesis, for 
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directly observed or estimated residuals. It then suffices to use critical values for p = 1, that 
is for the simple statistic nRf. The robust critical values of Lobato (2001) can be used when 
the residuals are directly observed. For estimated residuals, Kuan and Lee (2006) ones can 
be used. 

The paper is organized as follows. Section |2] details the penalty approach leading to the 
data-driven order p and the construction of the rejection region of the test. Section |3] studies 
the test under the general weak white noise null hypothesis and under the new class of 
alternatives mentioned above. It illustrates the importance of the choice of a suitable penalty 
both under the null and the alternative. Section |4] states our adaptive rate-optimality results 
and compares the new test with the Cramer- von Mises test in Deo (2000), the data-driven 
test of Escanciano and Lobato (2009) and the Xiao and Wu (2011) maximum test. Sectionjs] 
is a simulation experiment which proposes a calibration of the penalty term and a comparison 
of our automatic test with other data-driven ones, including Escanciano and Lobato (2009) 
and Newey and West (1994) plug in choice of the order. Section [6] concludes the paper. Our 
main assumptions are gathered and discussed in an Appendix, while proofs are grouped in 
a supplementary material document. 

2. Construction of the test and choice of the critical values 

Consider a parametric model m{Xt,Xt_i, , . . . , Zt;6) = Ut and observations X^, Zt-, t = 
1, ...,n. The scalar error term Ut has zero mean and finite variance and is unobservable when 
6 is unknown. In simpler situations Ut can be directly observed as for financial returns. We 
are interested in testing that ut is uncorrelated. Let 6 be an estimator of 6 and estimate the 
population residual Ut with its sample counterpart Ut = Ut{9). Suppose {ut] is a stationary 
process with zero mean and covariance function Rj = Cov{ut,Ut+j). Then the null and 
alternative hypotheses are 

Tio : Rj = for all j ^ 0, versus Tii : Rj for some j ^ 0. 
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A natural estimator of the covariance is Rj = Y^t=i^ ^tUt+\j\/n, j = 0, ±l,...,±(n — 1) 
which uses the estimated residuals as if they were the true ones. Given the kernel spectral 
density estimator 

= ^ ^ (-) exp i-ijX) , K{0)^1, K (x) = K i-x) , and fx (x) dx = 

^ j=—oo \P / 

where i — and the support of i^(-) is [0,1], Hong (1996) has proposed a Lagrange 

multiplier type test statistic 

% = riTi UX;p) - ^ dX = nJ2K' Q (2.1) 

For the uniform kernel K{t) — I{t e [0, 1]), Sp is the Box-Pierce statistic BPp — J^X^^^i R]- 
Large values of Sp indicate evidence against the null. The proposed test builds on a data- 
driven choice of the order p in an set [l,p„], p„ < — 1- Under standard weak dependence 
of {ut} and for p large enough, the mean and variance of (^Sp — Si^ / Rq are asymptotically 
close to, respectively 



Vi(p) = 2|;(l4)'(A- (^)-A-(i)) 



see e.g. Hong (1996) for independent {tit} and Shao (2011b) for the weak white noise case. In 
these notations, the subscript "A" indicates difference Sp — Si. For the Box-Pierce statistic, 

E^{p) and V^{p) are respectively equal to (p — 1) (1 + O (p/n)) and 2 (p — 1) (1 + O {p/n)). 
We propose to select p as the smallest maximizer of a penalized statistic, 

p = arg niax I ^ - E (p) - 7nV^A(p) 
pe[i,p„] yi?2 

= arg max ( ^^^^ - Ea{p) - 7„Va{p) ] , (2.2) 



where E{p) = Yl^=i ~ 3 1'^) {j /p)- Such a penahzation procedure is similar to Guay 
and Guerre (2006) or Guerre and Lavergne (2005). It differs from the ones used in the 
suboptimal AIC or BIG procedures reviewed in Hart (1997) which uses a higher penalty 
term jnE (p) in place of E (p) + 7nVA(p). Escanciano and Lobato (2009) similarly uses a 



penalty term ^jnE (p) for p in a finite set. The rationale for (2.2) is better understood when 
Sp is the Box-Pierce statistic BPp with directly observed or estimated residuals. In this 
case {^BPp — BP^ I Ea{p) is an estimator of n X]^=2 ^]/^o with a standard deviation 
which can be proxied with Va{p) when p — )■ oo, see Shao (2011b). Hence a large penalized 
statistic (^Sp — Si^ / ~ ^Aip) — 7nVA(p) suggests that ^ X]j=2 -^i /-^o large so that 
this particular order p should be preferred to p = 1. The selected p will retain the best 
order p with respect to this detection criterion. In particular, under the null, the Markov 
inequality yields that (^Sp — 5*1 j /Rq — E^ip) — JnVAip) = —In (1 + op (1)) Va(p), uniformly 
with respect to p G [l,p„] provided 7„ diverges fast enough. Since Va{p) > Va(1) = for 
all p > 1 and 7„ — > oo, 1 = argmaxpg[i_p^] (— 7ri,VA(p)) so that p = 1 should hold with a 
probability tending to 1 under "Ho- It follows that Sp = Si + o^ (1) under the null, for directly 
observed or estimated residuals. This leads to the following rejection region of the test 

Sp>zia), (2.3) 



where the critical value z{a) = 2~„(q;) satisfies 



lim P f^i > z{a)] = a under Ho- (2.4) 

n— >oo V / 



The critical values used here for directly observed residuals are from Lobato (2001) and from 
Kuan and Lee (2006) for estimated residuals. 
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A choice for critical values when {ut} is directly observed is as follows. Let a tilde super- 
script indicate more explicitely this specific case, 

"-lil 



Sp = nY,K^ r-j R^j where Rj = ^tUt+\j\. 



(2.5) 



As seen from Francq et al. (2005) or Lobato et al. (2002), the limit distribution of Si = 
nK^ (1) Rl depends upon the limit distribution n^/^ — Ri^ , which is under standard 
conditions a centered normal with variance 



lim nVar ( — 'S^ UtUt+i ) = Var (ulul) + 2 'S^K [{uqUi — Ri) {ukUk+i — Ri)] = Ti. 
\ t=\ / k=\ 

Hence, under "Ho, the limit distribution of SxjK'^iX) is the one of a chi square with one 

degree of freedom times the nuisance parameter Fi. The HAC approach developed by Kiefer, 

Vogelsang and Bunzel (2000), Sun, Phillips and Jin (2008) and pioneered by Lobato (2001) 

for weak white noise testing delivers a pivotal test statistic with a null limit distribution 

which does not depend upon Fi. Under suitable conditions, Lobato (2001) shows that 



nR\ 



^ n— 1 / * / ]^ n—\ 

where Fi = , 5Z 5Z ^^^^+1 " H 



converges in distribution to 



'3 

j=i \ " i=i 



1^2(1) 



-0 — (2-6) 

{W [r) - rW {\)f dr 

where is a standard Brownian motion, a limit distribution which is free from the nuisance 
parameter Fi. Hence following Lobato (2001) suggests the critical values 

Ii(a) =7^^2(1) fizi. (a), (2.7) 

where zi^ [a) are the critical values obtained from the distribution of the random variable 



(2.6) which are tabulated in Lobato (2001, Table 1). Alternative approaches would use a 
Newey and West (1994) estimator of Fi as in Francq et al. (2005), Lobato et al. (2002) or 
fixed bandwidth asymptotic as in Sun et al. (2008). Note however that these alternative 
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procedures involve an additional choice of a tuning parameter and may be more involved 



than (2.7) 



The HAC procedure of Lobato (2001) has been extended by Kuan and Lee (2006) to deal 
with the case of estimated residuals. Let 6t be the estimator 9 computed with the first t 
observations and consider the recursive estimator of Fi 

p.-(;^|:(E(%(?.)=-'(«')-;r^«0" 

Kuan and Lee (2006) shows that, under suitable conditions, nRf/Ti and nR\/Ti have the 
same limit distribution. Hence this suggests the critical values 



ZL{a) = K\l)ViZL{a) , (a) as in (2.7). (2. 



We shall also consider a modified version of the test which uses a standardization of the 
sample covariances as in Deo (2000) or Escanciano and Lobato (2009), 

5; - E '^'^ (^) (I) ' ^-^^ - {^^^) ^ ■ <^'" 

The sample variance is an estimator of = Var {ufUt-j) which is the asymptotic variance 
of n^/^ (^Rj — Rj^ in the case of uncorrelated UtUt+j or for Martingale differencej^ As above, 
Ut should be used in place of Ut when the residuals are directly observed, leading to statistics 
5** and r| with a tilde subscript instead of a hat one. The corresponding data-driven p and 
critical values are 

p* = arg max (s; - E (p) - 7n^A(p)) , (2.10) 
'^1 '^l 



%ote : 



however that differs from Var (^\/n (^Rj — Rj^^ which would be the appropriate standardization. 
But in fact all these quantities goes to i?§ when j — >■ oo, which is the reason why they can be used here. 
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3. Asymptotic level and consistency 

This section deals with the behavior of the test under the null and the alternative hypothe- 
ses. For the sake of exposition and brevity the associated assumptions are grouped and 
discussed in Appendix A. In what follows, a„ x 6„ means that the sequences {a„} and {&„} 
has the same order, i.e. that there is a constant C > 1 such that |a„| /C < |6„,| < C |a„| for 
n large enough. 



An important issue in the construction of the test (2.3 ) is the choice of the penalty sequence. 
Choosing 7„ large enough gives that p stays close to 1, hence that the test statistic Sp remains 
close to ^i. Hence, in one hand, using a large 7„ ensures that the level of the test is close to 



its nominal size due to the choice (2.4) of critical values which is driven by the asymptotic 
distribution of 5*1. On the other hand, a large 7„ may drastically limit the power of the test 
since the statistic Sp would not differ from Si, limiting so the power of the test. The trade-off 
between size and power concerns is addressed by the two first results of this section. Consider 
first the null hypothesis "Ho- The following theorem gives a lower bound for 7„ which ensures 
that the test is asymptotically of level a under the null. 

Theorem 1. Let Assumptions K , and[^ in Appendix A hold. If the penalty sequence 

{in, n > 1} satisfies 

1 /2 

7n > (1 + e) (2 Inlnn) ' for some e > 0, (3.1) 



then, under Ho, lim^^oo P (p = 1) = 1 and the test (2.5) is asymptotically of level a. 



The main result of Theorem [T] is about the asymptotic behavior of the selected order p, which 
is asymptotically equal to 1. It then follows that Sp = Si + op (1) so that the choice of the 



critical values (2.7) or (2.8), which accounts for estimation of the residuals and white noise 



dependence, ensures that the test is asymptotically of level a. Note that Sp = Si + op (1) 



allows to use other critical values than (2.7) or (2.8), as standard one degree of freedom 



Chi squared ones which are valid under a stronger null of independence. A key result is 
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therefore that hm„^oo P = 1) = 1 holds when the residuals are estimated or not and under 
various white noise structure. That this holds for estimated or directly observed residuals 



comes from (3.1) which imposes 7„ — )■ oo. When 6 is A/n-consistent as assumed here and 
under the considered assumptions, estimating the residuals gives test statistics satisfying 
Sp = Sp + Op (1) uniformly in p. The fact that the remainder term Op (1) is negligible 
compared to 7^ is a key element to show that the asymptotic behavior of p is not affected by 
residuals estimation under the null. Compared to the existing adaptive results of Horowitz 
and Spokoiny (2001), Guerre and Lavergne (2005), Guay and Guerre (2006) or Chen and 
Gao (2007), an important technical contribution is that Theorem [l] holds without assuming 
that the set of admissible p is a power set as {a^ ,j G N}, a > 1. 
Another important finding is that the penalty sequence 7^ can diverge with the low order 



(Inlnra)^^^ as allowed by (3.1). This contrasts with the larger order Inn used in the BIC 
selection procedure and in the corresponding data-driven tests, see e.g. Hart (1997). In view 
of the potential negative impact of a large 7„ on the power of the test, it is worth asking if 



the lower bound (3.1) can be improved. The proof suggests that it is not the case. The key 



argument comes from the expression 



P (p ^ 1) = P ((Sp - 5i) /Rl - E^{p) - 7nVA(p) > for some p E [2,pJ 

/ [(Sp-S,)/Rl-EM\ \ 
= P max ^ f— > 7n • (3.2) 



for the probability of not selecting 1 when the residuals are directly observed. In the case 
of the Box-Pierce statistic Sp — Si = nY^^=2^'j- The proof then uses a martingale approx- 
imation for Rj as in Xiao and Wu (2011), Shao (2011b), a smooth approximation of the 
maximum by the Lg norm, e — )■ 00 and repeated applications of the Lindeberg technique, 

^For positive Xi, . . .,Xm, {YlT=i ^it^" = (l + O (e^^ Inm)) maxfcg[i^„] Xk- 
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see Pollard (2002, p. 179), to approximate the LHS of (3.2) with 



P I max 

P6[2,p„ 



where the (j are i.i.d. standard normal. Hence the best order ensuring that F {p^ 1) = o(l) is 
the order of the maximum of the standardized sum (2A;)~^/^ Sj=i {Cj ^ ^) ^ k = 1, . . . ,p^ — l, 
which is (21nln (p„ — 1))^^^ x (21nlnri)^^^ as shown by Darling and Erdos (1956). The term 
(1 + e) is used to control for the fact that the variance of Rj/Vai {ut) are close but not equal 



to 1 due to possible dependence of the uncorrelated {ut}. Hence the bound (3.1) cannot be 
improved. 

Let us now turn to the detection properties of the test. Consider first the case of directly 
observed residuals {ut}. In our setup, the correlated alternative {ut} may depend on the 
sample size and the observations should be denoted Ut^n, t = l,...,n with a covariance 
function Rj = Rj^n, where for each n {ut^n} is stationary. This includes for instance local 
MA (oo) alternatives £t + YliLi '^i,n^t-i where ai^n when n grows. For estimated residuals 
Ut = Ut (^9^ , we assume that y/n (^9 — 6*^^ is asymptotically centered for some pseudo true 
value On and we set Ut (On) = Ut,n since this residual plays an identical role than the alternative 
{ut,n} of the directly observed case. However, for the sake of brevity, we write in both cases 
Ut and Rj instead of Ut^n and Rj^n- 



The new class of alternatives is defined similarly to (1.3) in the Introduction section. Con- 
sider first a sequence p„ — )■ and a lag order Autocorrelation coefficients smaller than 
Pn are considered as negligible and an important parameter for detection is the number of 
correlations above p„, 

iV„ = iV„ (P„,p„) = # {\Rj/Ro\ >Pn, 1 < J < Pn} . (3.3) 



The next theorem gives a detection condition on Nn, Pn and p„ which is similar to (1.3). 



Note however that (1.3) was derived assuming a known P„, a condition which is now relaxed. 
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Theorem 2. Suppose Assumptions K , M, ^and^in Appendix A hold. Then, there exists 



a constant k^, > such that the test (2.3) is consistent against all alternatives {ut} satisfying, 
for some Pn > and Pn G [l,p„/2], 

1/2 

Pn > (3.4) 



n 



1/2 



Nn 



1/2 



The most noticeable difference between the detection conditions (3.4) and (1.3) is that the 



lag index p„ in (1.3) is used in associated critical region (1.1) whereas the lag index P„ in 



(3.4) is unknown and can take any value in [l,p„/2]. This illustrates the adaptive feature of 
the new test. A second difference is that ( 3.4[ ) involves the penalty sequence. In fact (3.4) 
deteriorates with the penalty sequence since increasing 7„ request to increase p„ or Nn to 
ensure that the condition still holds. This illustrates the potential negative impact on the 
power of the test of the penalty sequence. 
For alternatives such that P„ and Nn are prescribed in advance, the detection condition 



(3.4) allows for a rate p* satisfying 



1/2- 
n 



1/2 



n 



1/2 



Nn 



(3.5) 



Two different regimes emerge in view of (3.5). Of special interest is limn^oo InPn /Nn = 



since (3.5) shows that the test can detect correlation coefficients converging to at a rate 
that is faster than the parametric rate n"^/^. The best possible rate in this case is p* x 



1/2 



which is achieved for "saturated" alternatives with A^^ x P„. A less favor- 

1/2 



able case corresponds to more sparse correlation coefficient satisfying lim„_j.oo InPn /Nn 



oo. In this case (3.5) does not anymore allow for correlation coefficients converging to with 
rate This case is covered Donoho and Jin (2004) and Ingster (1997) for a theoreti- 

cal model obtained when observing a known number P„ of independent Gaussian variables 
with mean n^Rj/Rof' and variance 1. In such a setup, the authors show that the best 

1 /2 

possible detection rate p„ is (In n/n) ' , a rate which is achieved by the maximum white 
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noise test of Xiao and Wu (2011). This suggests that our test may not be optimal when 
hm„_!.oo 7n-Pn^^/^n = oo. Howevei, it will be shown in Proposition hJ below that the Xiao 



and Wu (2011) test does not detect moderately sparse alternatives satisfying (3.5) with 
Yimn^oo InPn /Nn = and 7„ X (21nlnra)^/^. 



We conclude this section by showing that the test statistic Si, from (2.9) and (2.10) has a 



similar behavior than S^. 



Theorem 3. Suppose Assumptions 



K 



M andj^ in Appendix A hold. Then using (^S^ , (a) 



or iS^,zl{a)] in (2.3) instead of (S'p, ^^(a)] or iSp,'zL (a)) gives a test which satisfies 



the conclusions of Theorems [I] and \^ 



4. Adaptive rate-optimality and comparisons with other tests 



While Theorem 111 gives a sharp lower bound (3.1) of order (2 In In n)^^^ for the penalty 



sequence 7„ ensuring that the test is asymptotically of level a, Theorem [2] suggests that 
increasing 7„ can damage its detection properties. Hence a good compromise for the choice 

1 /2 

of the penalty sequence suitable both under "Ho and "Hi is 7„ x (2 In Inn) . Once such 
a choice is made one may wonder if the detection properties of the resulting test can be 
improved or not. Focusing on alternatives with a prescribed -P„ and A^„, this amounts to 
show that there is no test detecting alternatives with supj^^i p^^ |-Rj/-Ro| = o (p*) where p* is 



as in (3.5), a property that we call adaptive rate-optimality. More generally this amounts to 



check that there is no test that can detect alternatives satisfying a condition less restrictive 



than (3.4), i.e. allowing for a = k„ — )■ in (3.4) leads to consider alternatives that 



cannot be detected by any possible tests. The next Theorem establishes such adaptive 
rate-optimahty for alternatives satisfying lim^^oo 7n Pn^'/Nn = Ofl 



■'As discussed after (3.5 1, the test (2.3 1 is not optimal for detection of sparse alternatives with 



1 /2 

lim„_>.oo ^nPn /Nn — oo which are not considered here. 
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Theorem 4. Consider the case where {ut\ is directly observed. For any k„ — )■ 0, there exists 
a sequence of alternatives {uf} such that, for some Pn G and a p„ > with 

Pn > ^ , lim = 0, 

and satisfying the other assumptions of Theorem that cannot he detected by any possible 
asymptotically a-level test, a G (0, 1). 



Hence, when 7„ x (2 Inlnn)^''^, it is not possible to improve the detection condition (3.4) 



and the rate p* in ( 3.5[ ) is optimal. We shall now give alternatives which are detected by the 



test (2.3) but not by other popular tests. Consider the following high-order moving average 



process, 

1/2 P„ 



Mt = Mt,„ = etH " 1M y^^fcgt-fc, y^yl = 0{Pn), lim P„ = oo, (4.1) 



„l/2p^ 



where {et} is a strong white noise with variance cx^, z/ is a scaling constant and jn ^ 
(2 In In n)^''^. This alternative has MA coefficients of order 7^^/ (^n^^'^Pn^'^^ which goes to 
faster than n~^/^ provided Pn diverges with a polynomial rate. Hence short term shocks have 



a statistically negligible impact. However the long term multiplier of (4.1) is, when ip^ = 1 
for all k, equal to u i^nPn'^ /n^ which has a larger order than rT^I'^ . The following lemma 



describes the covariance function of the alternative (4.1). 



Lemma 1. If Pn = o((n/7„)^/^) and lim„_>oo (in/n) = 0, then the alternative {ut} in (4-1) 
satisfies Rq = (^1 + O (^jnPn^'^ / and, uniformly in j G [1, Pn], 

1/2 / 1/2 \ 



Hence a distinctive feature of the alternative (4.1 ) when maxi<fc<p,^ \^pk\ = O (1) is that both 



its moving average and correlation coefficients approach zero uniformly faster than n 
provided Pn/ln tends to infinity. We shall show below that the new test (2.3) detects these 
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alternatives but that this not the case of the three following test statistics based on directly 



observed variables and with rj as in (2.9), 



Wr, =br, \ n^/^ max 
i6[i,J„] 



Rj 



- br, 



hn = (2 In Jn - In In J„ - In (47r)) 



1/2 



J=l •' ^ 



(4.2) 



(4.3) 



ELr, = BP~* , p'el = arg max <^ BP^ - -f^^^p \ where 



7el 



pe[i,Jn 

Inn if n^/^maxjg[i,j„] 
2 otherwise. 



R 



< (2.4 Inn 



1/2 



(4.4) 



The statistic (4.2) is studied in Xiao and Wu (2011) who shows that Wn has asymptotically 



an extreme value distribution. The statistic (4.3) is due to Deo (2000) and is a version of the 
Cramer-von Mises test of Durlauf (1991) partially corrected for heteroskedasticity. The test 
statistic ELn has been introduced in Escanciano and Lobato (2009) who considers a fixed 
Jn- We show that it can also work when J„ increases with the sample size. The numerical 
value 2.4 used for is used in the simulation experiment of Escanciano and Lobato (2009) 
but the proof of Proposition [l] below suggests that any real number strictly larger than 2 
would also work. As for our test, the Escanciano and Lobato (2009) selected order p*^j^ is 
asymptotically equal to 1 under T-Lq and similar critical values can be used. To show that 
these tests do not detect alternatives with small correlation coefficients, it is sufficient to 
consider the single Gaussian null hypothesis Gq: {ut} is a Gaussian white noise {et} with 



variance and the Gaussian alternative Gi: {ut} is given by (4.1) with Gaussian i.i.d. {st}, 
Y.llii'l = 0{Pn), maxi<fc<p„ \ipk\ = O (1) , mini<fc<p„ iV^fecr^l > 1, z/ > 0, 7„,P„ 00 with 
7„/p„i/2 = o{l/\nn) and P„ = O ((ri/7„)^^^^) < p„/2, 7„ x (2 In In n)^/^ satisfies (3.1). We 
also assume Jn = O (n^/^) . 
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Proposition 1. Let {ut} be directly observed. Suppose that Assumptions and [P| in Ap- 
pendix A holds. Then, for v large enough, the alternative G\ satisfies (3.4^ and 



(i) The new test (2.3) and its version consistently detect Gi 



(a) By contrast, the statistics Wn, CvMn and ELn have the same asymptotic distribution 
under Gq and Gi and the corresponding tests are therefore not consistent. 

Proposition [l]-(ii) implies tliat tests based on Wn, GvMn or ELn are not adaptive rate- 
optimal. This is due to a continuous behavior of these test statistics that prevents detection 
of Gi as explained now. Let Rqj/tqj and Rij/ti^j be the standardized sample covariance 
computed under Go and Gi respectively. It is established in the proof of Proposition [T] that 



max 

je[i,J„] 



Ro,j 


Ri,j 


= ''^[ ( 1 a/2 ) 






y(nlognj ' J 



(4.5) 



a fact which implies that he tests Wn and CvMn are not consistent. The case of the ELn test 



is a bit more tricky. It is first shown that (4.5) yields 7|;^ = Inn with a probability tending 
to 1 under both Go and Gi because the statistic Wn has the same asymptotic behavior 
under Go and Gi. The next step is to show that P {pel = 1) — ?• 1 under Go and Gi. This 
holds by construction under Gq. To understand that this also holds under Gi, observe that 



Pel = P > 1 implies in particular that BP^ + YelP — BPi + Yel by (4.4), an inequality 
which is equivalent to 

' ' >7k = lnn + op(l). (4.6) 



p — I 

/ H< *\ 

But, due to the division by j9 — 1, the asymptotic behavior of ( BP^ — BP^ j / (p — 1) is the 
same under Go and G\ by (4.5), so that P {p*el = 1) — ^ 1 under Gi. Since ( 4.5[ ) also gives 
that the limit distribution of BP^ is the same under Go and Gi, this would also be the case 
of the test statistic BP~,^^ = BP^ + op (1), so that the Escanciano and Lobato (2009) test 



ELn is inconsistent against Gi. As seen from (4.6), this is due to a too high penalization 
term JelP^ proportional to p, for BPp. This contrasts with our test which applies a penalty 
of lower order •ynP^^^ to the "debiased" test statistics BP — E {p). 
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5. Simulation experiments 

This simulation experiment aims to propose a reasonable value of the penalty sequence 7„ 
to be tested with various strong and weak white noise processes and with various alternatives. 
As preliminary experiments have shown that the test statistic Sp may yield an oversized test 
for some practically relevant white noise processes, we consider the test based on 5*^, as in 



k{t) 



(2.9) and (2.10) with the critical values (2.8), z* (a) for directly observed variables and z* (a) 
for estimated residuals, a = 10%, 5% and 1%. To investigate the impact of choosing a large 
p„ we allows for all possible orders and set p„ = n — 1. We consider two kernel choices. The 
first is K (t) = I (|t| < 1) which gives the Box Pierce statistic so that the corresponding tests 
are labelled BP. The second is a modified Parzen kernel 

l-6t2 + 6|x|3, |t|<l/2, 
2(l-|t|)3, 1/2 < |t| < 1, 

otherwise. 

Since k (1) = 0, we use the choice K (t) = k (t/2) /k (1/2) and label the corresponding tests 
as Par. 

The first experiment parallels Theorem [T] and aims to calibrate the penalty sequence. It 
analyzes the sensitivity of the test to the penalty term. It investigates the behavior of 
the test under the null for 7n = 7 (2 In In (n — 2))^^^ where the proportionality coefficient 7 
ranges from 2.8 to 3.8. The considered white noise is a directly observed {ut} with a standard 
normal distribution. The next table reports the simulated levels from 50, 000 replications and 
the percentage %{p* 7^ 1}, an important indicator to decide whether a difference between 
nominal and observed levels is due to p* or to the choice of critical values. 

[INSERT TABLE 1 HERE] 

In Table 1, a '*' indicates a statistically oversized test, i.e. with a level statistically greater 
than the nominal one at the 1% level. A threshold value for the BP test is 7 = 3.4 which 
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ensures that the observed sizes are close to the nominal for n = 1, 000. The Par test is 
slightly better behaved with this respect. Both tests have very similar % {p* 7^ 1} which is 
well below 1% for 7 = 3.4. The rest of the simulation experiments will use 7 = 3.4. 
Let us now introduce some benchmark tests. We shall compare our BP and Par tests with 



the data-driven test EL based on the statistic ELn in (4.4) with J„ = — 1 and the Lobato 



(2001) and Kuan and Lee (2006) critical values in (2.8). We also consider the Newey-West 



data-driven order Pimse used in Hong and Lee (2005) and the test statistic 



PiMSE={iyc'/'if))n'/', where c(/) 



i44E;:-(n-i)fc(j7p)j^fi|/n' 

0.539285 E;i'(„_i)fc(jyp)^j/^' 



J 



E%T'' U /pimse) {r]/9! - (1 - D } 
IMSE = 



2Y.]T'' k^{j/ Pimse) [l-iY^"^ 



where (■) is the Parzen kernel and is as (2.9). In the definition of Pimse, p is a pilot 



bandwidth set to p = (4ri/100)^/^^. Observe that c(/) remains potentially stochastic under 
the null so that the null limit distribution of IMSE may differ from the standard normal 
obtained by Hong (1996), Xiao and Wu (2011) and Shao (2011b) for deterministic p. We 
follow however common practice and the IMSE test will use standard normal critical values. 



The last benchmark test, CvM, is the Deo (2000) Cramer-von Mises statistic CvMn in (4.3) 
and uses the critical values tabulated in Anderson and Darling (1952). 

The first comparison under 1-Lq is based on i.i.d. {u^} with the following distributions: 
standard normal ('Nor' in Table 2), a Student with three degrees of freedom ('Stud'), and a 
centered chi square with one degree of freedom ('Chi'). The Student distribution is used to 
test the sensitivity of our test to the lack of higher-order moments and the chi square one 
can reveal sensitivity to skewness. 



[INSERT TABLE 2 HERE] 
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As in Table 1, the Par test is slightly better than the BP test but both behave well here. 
The highest % {p* 7^ 1} for the tests BP and Par are achieved the centered chi square 
distribution. BP and S are shghtly oversized under 'Chi' but still behave better than the 
CvM test which is their best competitor in this experiment. BP and Par seem to be not 
sensitive to the lack of higher moments as revealed from the 'Stud' experiment. The EL 
test is oversized due to a very high % {p* ^ 1} since % {j^el Inn} is also high. Escanciano 
and Lobato (2009) reports a similar behavior even when J„ remains finite. The IMSE test 
is conservative at the 10% level but has a level which seems quite far from the nominal size 
when a — 5% or 1%. This is due to the fact that Pimse remains moderate and quite close 
to 1 while the normal critical values of the IMSE test build on the fact that Pimse should 
theoretically diverge under Hq. The CvM test behaves well except for 'Chi' where it is more 
oversized than BP and Par for n — 200. 

The next experiment considers directly observed or estimated weak white noise {ut}. Two 
conditional heteroskedastic differences of martingales arc examined. The first process is a 
GARCH(1,1) with Ut = StCt and = o.OOl + 0.90s2_^ + 0.05ul_^ where the i.i.d. Ct are 
standard normal. This process, which puts a high weight on has been used in many 
simulation experiments, see Lobato et al. (2002) who justifies this choice with financial 
markets examples or Escanciano and Lobato (2009) among others. The second martingale 
difference is the ARCH(l) ut — StCt and sf = 0.001 + 0.9Uf_i with a dynamic of carried 
by Due to an ARCH coefficient larger than 1/3, E [m^] = 00 and the tests are, in 

principle, not expected to behave well in this experiment. The three next processes are 
uncorrelated but are not difference of martingales, so that the CvM test is not expected to 
have a correct size and is just reported here as a benchmark. The first, labelled 'Bifinear' 
in Table 3 below, is a bilinear model Ut — Ct + O-QCt-i'U't-2- The second, labelled 'No-MDS', 
is given by Ut — Ct-iCt-2 (1 + Ct-2 + Ct) and is from Lobato (2001). The third, 'All-Pass', 
is an All-Pass ARMA(1,1) process (Breidt, Davis, and Trindade, 1999) as in Lobato et al. 
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(2002), Ut — 0.5ut-i = (t — Ct-i/0-5 where the i.i.d. (t have a Student distribution with 
9 degrees of freedom. Since the root of the MA part is the inverse of the AR root, the 
resulting process is uncorrelated but the Ut are dependent due to non Gaussian Q- Finally, 
the last process 'ARRes' considers estimated residuals from the AR (1) yt = 0.8?/j_i + Q, 
Ut = Ut — 0yt-i, = Ylt=o ytUt+i/ Ylt=o V'i- "^^^ ^"^^ tests are all adjusted to 



tackle the estimation effect by using the critical values z* (a) of (2.11). The critical values 
of the tests IMSE and CfMare not adjusted so they are not expected to perform well for 
this case. 

[INSERT TABLE 3 HERE] 



The behavior of the BP and Par tests is very good with observed levels which are not 
oversized in general. This is due in part to a % {p* 7^ 1} which is always much smaller 
than 1%. However the BP and Par tests can be undersized as in the case of 'ARCH(l)'. 
But even in this case % {p* 7^ 1} remains very small suggesting that this is due to Lobato 
(2001) critical values. A not reported simulation study shows indeed that using instead 
standard Chi-squared values gives much better observed levels around 10%, 4.5% and 0.7% 
for the BP and Par tests, n = 200 or 1, 000. The behavior of the EL test is much more 
erratic, with observed levels which can be severely oversized as for 'Bilinear' or undersized 
see 'ARCH(l)'. This comes from a % {p* 7^ 1} which is much higher than for the BP and 
Par tests. The IMSE test can also be severely undersized or oversized especially at the 
nominal 5% level. The CvM test performs well except, as expected, for weak white noises 
or estimated residuals. 

We now consider "Hi. A first set of low lags alternatives will be calibrated using the 
Cramer von-Mises norm -D^^^/ = X]j=i ^"j/ (^^J^-^o) which is the counterpart of CvMn/n. 
We shall consider lacunary AR{P), ut = Out-p + St and MA (P), ut = et + Oet-p satisfying 
^CvM = 3/n for i.i.d. (0, 1) et- We shall select the positive AR and MA coefficients pp^n 
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and 9p^n with D^j^j^ = 3/n. It can be shown that this gives 

3l/2p 3l/2p 

PP,n = ^^i^ + oil)) and ^p,„ = ^^ (1 + 0(1)), 

for fixed P so that the resulting alternatives can also be viewed as local Pitman alternatives 
going to the null with the parametric rate n~^/^. For P = 1, n = 200 and 1, 000, we consider 
the AR{1) and MA{1) alternatives obtained for n = 200, with the view of seeing the impact 
of increasing the sample size on the power. We also consider larger values of P = 4 and 6 in 
which case we allow the alternative to vary with the sample size. This gives the six following 
alternatives: MAI, ut = €t + 0.12Uet-i for n = 200 and 1, 000 ; ARl, ut = 0.1233nt_i + et 
for n = 200 and 1, 000; MA4, Ut = et + 0.8165£t„4 for n = 200 and Ut = et + 0.2307ef_4 for 
n = 1, 000; AR6, Ut = 0.6849Mf_6 + et for n = 200 and Ut = 0.3242Mt_6 + et for n = 1, 000. 
In Tables 4, 5 and 6, p* and Sp* are the simulation mean and standard deviation of p*. Such 
statistics are useful to conjecture the impact on the power of p„ since large or p* and Sp* 
suggests that decreasing p„ can decrease the power. 

[INSERT TABLE 4 HERE] 

The low lags ARl and MAI experiments have very similar characteristics. The data-driven 
tests BP, Par and EL seem to be outperformed by the IMSE and CfM tests. This is 



actually due to the fact that the former use the robust critical values (2.11). Using chi- 
square critical values as in Table 5 shows that all the test perform similarly. The fact the 
the IMSE test seems more powerful at the nominal 1% level is not really meaningful since 
Table 2 reveals that IMSE is oversized at this level. 

[INSERT TABLE 5 HERE] 

For the higher order experiments MA4 and AR6, the BP, Par and EL tests behave very 
similarly and clearly outperform their competitors with a power close to 100%. An interesting 
fact is the very high values achieved by p* and Sp* for the BP and Par tests. This is due 
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to the selection procedure (2.10) which, compared to (4.4 ), penahze less large p. Note that 
such a behavior of p* prevents from using the selected order to estimate the order of the 
underlying process. 



The second set of alternatives are randomized small correlations processes as in (4.1) 



fc=i 



In this setting & = 1, 10, 000 is the simulation index. New MA coefficients {V'fc.fe} are drawn 
for each simulations. Randomizing the moving average coefficients allows us to explore 
various shapes of the correlation function. The noise {e^} is independent of the moving 
average coefficients {^k,h\ and is drawn randomly from the standard normal distribution. 
Since ^Yl^=\'^\h — P^^o^^) when P tends to infinity, the covariance structure of the 
alternatives (5.1) is described in Lemma [l] We consider two scenarios. In the experiment 



'LOW, P is set to 15 for n = 200 and to 75 when n = 1,000. The experiment 'HIGH' 
doubles the order P, P = 30 for n = 200 and P = 150 for n = 1, 000. The next table reports 
our simulation results. 

[INSERT TABLE 6 HERE] 

The BP test outperforms its competitors. The EL test achieves a similar performance only 
in the LOW experiment when P = 15 and n = 200. The Par test performs similarly to the 
BP test only for large P = 75, 150 and n = 1, 000, in which case it also outperforms EL, 
IMSE and CvM. The IMSE test performs poorly and is even dominated by CvM due to 
a piMSE which remains very close to 2. As well, the average value of pel is much lower than 
the ones achieved for the BP and Par tests. The high values of p*Bp and p*par niay suggest 

1/2 

that these tests would be affected by a lower choice of p^. However setting p„ = 3 {n/2) ' 
give similar conclusions for the BP test. 
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6. Concluding remarks 

The paper proposes an automatic test for the weak white noise null hypothesis when the 
variables are directly observed or estimated residuals. The test is based on a new data-driven 
selection procedure of the order used in a Box and Pierce (1970) test statistic. The critical 
region uses Lobato (2001) robust critical values when the variables are directly observed 
and Kuan and Lee (2006) ones for estimated residuals. An important theoretical finding is 
that the new test can consistently detect alternatives with small autocorrelation coefficients 
of order pn — o where n is the sample size, provided that the number of O (pn) 

autocorrelation coefficients at reasonably moderate lags remains large enough. The proposed 
test is shown to be adaptive rate-optimal against this class of alternatives. The paper 
gives examples of MA alternatives with small autocorrelation coefficients of order o 
which are detected by the new test but not by previous procedures proposed by Deo (2000), 
Escanciano and Lobato (2009) or Xiao and Wu (2011). These alternatives correspond to 
a plausible macroeconomic scenario where a temporary shock has no significant impact 
whereas permanent ones may cause some significant changes. A simulation experiment has 
shown that the new test can cope with various weak white noises including some ARCH 
or GARCH processes popular in empirical finance. The simulation experiment has also 
confirmed the good power properties of the test regarding detection of standard AR{1) and 
MA(l) alternatives as well as detection of small o (n~^/^) autocorrelation coefficients. The 
methodology considered here can be applied to many econometric problems which involves 
models with many parametric coefficients including inference for impulse response functions, 
VAR Causality testing, significance testing in series expansion or detection of potentially 
weak instruments. 
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Appendix A: Main assumptions 

In what follows, \\Z\\^ = EV« [|Z|"] where Z is a real r.v. and a a positive real number. 
When studying the performance of the test under the alternative, we consider a sequence 
{ut,n} of stationary alternatives with autocovariance coefficients {Rj^n}- This means that 
for each given n, the process {iit,„, t G N} is stationary. Note that {ut,n\ and {Rj,n} were 
abbreviated into {ut\ and {Rj} in the main body of the paper. We follow, under the null 
and the alternative, Xia and Wu (2011), Shao (2011b) and shall restrict ourselves to un- 
correlated stationary processes satisfying a Moment Contraction condition from Wu (2005). 
We shall assume that Ut^n = -Pn (• • • , <^t-i,^t) for some measurable F (•) and where the e^, 
t = — oo, . . . , +00, are i.i.d. (univariate or not) r.v. Consider an independent copy {e^} of 
{cj} and define ior t <t <n 

that is Cr is changed into e'^ in ul^^. The magnitude of the differences Ut — is a measure 
of the process sensitivity to shocks on the past innovations. More formally assume that for 
some a > and for all j > 

U't,n — ''A^Wa — (j) ' where 5a (j) decreases to when j — >■ oo. 

Shao (2011b) assumes that Sa{j) decreases with an exponential rate, a condition which 
is fulfilled by many linear or nonlinear time series models, including threshold, stochastic 
volatility, bilinear or GARCH models, see Shao (2011b), Wu (2005, 2007, 2009) and the 
references therein. Our main assumptions are given below. 

Assumption K. The kernel function K (■) from to [0, oo) is nonincreasing , hounded 
away from on [0, 1/2] and continuous differentiable over its support [0, 1]. 



26 



Assumption P. The maximal order diverges faster than some power withp^ = o^n^^^"^^^^^^^^^] 
as n ^ oo, where a > 1 is as Assumption\^ below. The penalty sequence 7„ satisfies 7„ > 0, 
7„ — )■ oo and 'jn = o (^n^^^) as n ^ oo. 

1/2 

Assumption R. Under T-Lq and l-ii, sup^ Il'"t,n|li2a ^ ^o-Rq n f^'^ some a > 1 and, for some 
b> 0, 6i2a (j) < Clj"'"^ Moreover I/C2 < Ro,n < C2, anc? maxje[i,p^] „/ Var „) < 

Cs. 

Assumption M. The processes {ut^n}, the model and the estimators c-"^^ such that: 

(i) There is a sequence {9n}, with On = 6q for all n under T-Lq, such that 

(^M -On) , ^ {ut,nUt-l,n - E [Ut,nUt-lA) 1 , S G [0, l] I (A.l) 

D[Q I] -converges in distribution to a Brownian Motion with a full rank matrix; (ii) The resid- 
ual function admit a second order expansion Ut (6) = Mt_„+(6'— + ~ ^n)' n ~ ^n) + 
^t,n (0) where, for any C > 0, 




sup sup \X:t,n 

«e[l,n] 6»;||e-e„j|<Cn-V2 



(A.2) 



and, for each n, {-u^ „, „, -uj ^} is a stationary process with¥}/'^ [ll^^tll ] — C*4, {at} being suc- 
cessively ^uf^^ , I^S.^iMj^i'}, |Mt,„MS5j|, 



E 



(1) 



< 



C5 anc? supjg^E 
C7 and supjg^ E 



(1) 



< Cg, sup .g^E 



(1) 2 



n 



-1/2 



Y^n / (1 



(1) 2 



< 



The compact sets [0,1/2] and [0,1] in Assumption K are somehow arbitrary and can be 



replaced by any nested compact intervals. Note however that Assumption K forbids the use 
of the Daniell Kernel K (x) = sin (x) /x due to the nonincreasingness and bounded support 
conditions. 

Assumption [R[-(i) assumes a polynomial decay for the coefficients 5i2a (j), a condition which 
is weaker than the exponential rate assumed in Shao (2011b). Note also that, in Assumption 
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P| the order of can come closer to n^/^ when a increases. Under Assumption |Rj-(i), 
{ut^n} must have finite moments of order twelve at least. This is mostly needed for a proof 
of Theorem [l] based on Lindeberg Substitution Method, see Pollard (2002, p. 179), which 



requires to bound moments like E 



t "'t+j 



Y < E [u}^] . However our simulation experiments 



suggest that the test (2.3) is well behaved with a low %{p 1} even when {ut} has a fat 
tail Student distribution or no fourth moments as in the ARCH(l) experiment. It is possible 
that a proof for Theorem [l] which would better use the self normalization of Sp/R^ can work 
under better moment conditions as moderate deviation, which is at the core of our proof, 
does not request existence of higher moments under self normalization (see e.g. de la Pena, 
Lai and Shao, 2009). 



Assumption M is a shortened version of Assumptions Bl and A2 of Kuan and Lee (2006) 



who uses a standard linear expansion n^^"^ (0 — On 



n 



'1/2 



XltLi + Op (1) to show that 



(A.l) satisfies a Functional Central Limit Theorem (FCLT) as requested in M-(i). This 



FCLT is mostly used under "Hq to show that (2.4) holds. Note that the full rank FCLT 



condition in Assumption M -(i) can be quite restrictive. For a correctly specified AR{1) model 
Xt — OXt^i = Ut, this rules out for instance 9 = 0. However such an issue can be addressed 
when an additional test statistic T with proper critical values t (a) is available, as the ones 
proposed by Francq et al. (2005), or by Delgado and Velasco (2010, Theorem 3) which gives 
a general approach to obtain test statistics which are not affected by parameter estimation. 
Indeed, because P (p = 1) — ?■ 1 under Hq as shown in Theorem[l| setting z (a) = Si+T~t (a) 
gives lim„^oo f(Sp>z (a)) = hm„^oo P f > ^ («)) = lim„^oo P (T > t (a)) = a. When 



{ut} is directly observed. Assumption M amounts to Assumption 1 in Lobato (2001) and 
the FCLT for n""*^/^ ^1""^] (ufUt_i — E [n^n^.i]) is a consequence of [^(i) and Wu (2007). 

(2) 

Assumption M is easily checked for simple linear models and OLS estimation where ui „ and 
Xt^n can be set to 0, see also Francq et al. (2005), Hong (1996) and Shao (2011b). 
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Appendix B: proofs of main results 



This section contains the proofs of the results of Section 3. C and C are constants that 
may vary from hne to hne but only depend on the constants of the assumptions. Notation 
[■] is used for the integer part of a real number and a V = max {a,b), a A b = min (a, b). 



Let u 



u 



t,n 



be a copy of ut = Fn , et-i, ej) obtained by changing et-j, ^t-j-i, ■ ■ ■ into 



Then the condition — •'ll < 5a (j) ensures that 



\Ut - 



''\l<Qa{j) where 9, (j) = 5a (j) • 



(B.i; 



We first state some intermediary results that are used in the proofs of our main results. 
These intermediary results are proven in Appendix C. Lemma |B.l gives the order of stan- 



dardization terms E{p), E^{p) and V/^{p). Propositions B.I and B.2 deal with the impact of 



the estimation of 9. Proposition |B.3| is used to study the asymptotic null behavior of the test 
and to show that P (p = 1) — t- 1 in Theorem [l} Proposition B.3 deals with directly observed 
or estimated residuals thanks to Propositions B.l| and B.2 Propositions B.4| and B.5 are the 
key tools for our consistency result, Theorem [2j They dealt with directly observed variables 
but are combined with Propositions B.I and |B.2 to deal with estimation errors in the proof 
of Theorem [2l 

Lemma B.I. Suppose Assumption\l^holds and thatp^/n < 1/2. (i) There exists a constant 
C > 1 such that, for q = 1,2 and for any 1 < p < p^, ^ < J2j=l (l - iV ^^'^ (j) < 



Cp, § < EU^'' [ij ^ ^P' ^a(p) < Cp, and E^{p) < [K' [^) - K^j)J < 

Cp^^^Va{p); (ii) Under Assumption^ for all n and all p G Va{p) > C{p — 1)^^'^ and 

Ea{p) > 0. 



Lemma B.2. Suppose Assumptions M and\Mhold. Then the statistics and associated critical 



values iSi,ZL{a)], ( 5*^,^2 (a)), iSi,'zL{a)] and iSl,zl{a)] satisfy (2.4) that is give an 



asymptotic a level test. Moreover, under Tii, zl {a), z1 {a), (a) o-nd ^ (a) are all Of (1) 



Lemma B.3. Under Assumption 



R 



suPo<j<„_iVar [RA <^ 



Proposition B.l. Suppose Assumptions 



Op {n , maXpe[o,„_i] n YI'j=i ( - Rj) = Of (1), and 



m 

1 



1^ and^ 



hold. Then max, 



■i6[0,p„ 



Rj - Rj 



max 

ie[o,n-i] 



max 



Rj ^j^n 



11 3 \ |~2 2 

max 1 h"; — ^1 « I 



log 72 \ 

logn\ 

log 72 \ 
72 / 



max |r,- — r,-„ | = O 

i6[0,p„ 



J J," I 



log 72 



72 



l/2^ 



Proposition B.2. Let Assumptions [k| [m| anc? i? /20W. Lei Sp he as in (2.5). Then 

Sp — Sij — {Sp — Si 



max 

pe[2,p„] 



1/2 



Op(l) 



an 



d for any p„ = 0(72i/2), Sp„ - = Op 1 + ( 72 ZT=i 



1/2 



Proposition B.3. Suppose Assumptions K, \^ \^ and\^ hold and that Hq is true. Then 



(3.1) ensures that 



jSp - Si)/Rl - E^jp) ^ 
lim P max — > 7n I =0. 

Proposition B.4. Under Assumptions^^ anc?[^ there are some C,C' > such that for 
72 large enough and uniformly in p E 



E 



p/2 



-Rl^Eip)>CnJ2Rln-C'Rl 



3 

Proposition B.5. Under Assumptions \^ [P| and[!ffl, there is a constant C > such that for 
n large enough and uniformly in p & [l,p^], 



B.l. Proof of Theorem [T| ( |3.2[ ), (3.1) and Proposition B. 3 give that lim„_j.oo P(p 7^ 1) = 0. 
Hence 5*^ = Si+op (1) and Lemma B.2, which ensures that the retained critical value satisfies 
(2.4), yield that the test (2.3) is asymptotically of level a. □ 



B.2. Proof of Theorem [2| The definition (2.2) of p gives, for any p G 



Sp = arg max Sp - R^E (p) - 7ni?o^A (p) + Rt>E (p) + IuRoVa 
>Sp-RlE {p)-^nRlV^{p). 



Since the critical value z (a) in (2.3) is bounded under 1-Li by Lemma B.2, it is sufficient to 



find a p„ G such that Sp^ - R^E - 7„-Ro^a (Pn) +00. Let Pn = 2P„ where P„ 



is as in (3.4). Set 



The detection condition (3.4) gives 



2 1/2 

>Pn)= nNnpn > > OO, 



21/2 



(B.2) 



with a constant k^, which can be chosen as large as needed. Lemmas B.l B.3 , Assumption [P] 
which ensures Pn = o (n^^^) and 7n = o (^n^^^), and Proposition B.l 



for the case of estimated 



residuals yield that 



= Sp^ + Op (1 + v}/^Ro,nTln) - RlnE (Pn) ' 7ni?0,n^A (Pn) + Of 



Pn + lnP: 
1/2 



1/2- 



n 



> Sp„ + Op (1 + n'/^Ro,nn„) - Rl^E - C^nRlnVli 



1/2 



Now the Chebycheff inequality, Propositions B.4 and B.5, give 



Op (Vari/2 is A) > Rl^E + C Rl^nUl + Op [pH^ + n^/^Un) 



Hence substituting gives, since nTZ^ — )■ oo by (B.2), 



Sp^ - RlE {pr,) - inRlV^ (Pn) > C R^nTZl (1 + op (1)) - C^nRiyJ' (1 + op (1)) 



Since Assumption R ensures that i?Q„ stays bounded away from 0, (B.2) gives that Sp^ — 
RqE (pn) — 7n-Ro^A (Pn) +00 as requested provided > C /C . □ 



B.3. Proof of Theorem |3| Consider first the null hypothesis. As seen from the proof of 
Theorem [T| it suffices to show that 

limP max / ^ '/ ^^^^ > 7« I = 0, 



a statement which implies that p* = 1 + op (1) so that Lemma B.2 implies that the conclusion 
of Theorem [l] holds for the test based upon Si,. Since |-Rj,n| < Il'^^t,n|l2 H""*," ~" ^t^''||2 ^'^'^ 



E 



u 



t,n 



U 



t-j,n 



Ro,n + E [{Ut,n - UtJ) {Ut,n + u\j) ul_.j^^ , 



(B.l) shows 



\Tl^-Rl^\<C\\utAlQ2{3)<C3 
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(B.3) 





for all j > 1. Now Lemmas B.l and B.3 Assumptions K P]and[Rl and Proposition B.l give 



max 

pe[2,p„] 



{S* - SI) - {Sp - Si)/Rl 



VAip) 



< C max 

pe[i,p,J 



P 



.1/2 



< C max ^ i 

P6[l,pJ pV2 j^ \R^ 



f2 r2 



■0,n 



< CnfJ^Oi 



'logn 



n 



3/2^ 



■(l) + Op|$^- 



Op(l). 



Hence (3.1) and Proposition B.3 



P I max 



{s;-si)-E^{j>) 



> 7n 



,P6[2,pJ I^aIp) 



< P 



{ max (^--^y^V^-(^) > (l + ^) (21nln.)vA + 
\^pe[2,Pj Va(p) - V 27 ^ ^ y ' 



=0(1), 



which gives the desired result under T-Lq. 

Consider now Theorem [2] and T-Li. Define 



3=1 



'2 / l\ 3 



Let Pn be as in (3.4) and define Pn = 2P„ and 7^„ as in the proof of Theorem 2 Then 



Assumptions [K] and [R[ Propositions |B.1| and |B.2 



< 



cuy: 



— Sp^ Sp^ 



Hence, for directly observed or estimated residuals, 



S. 



Pn 



1 + 



logn 



n 



l/2^ 



The proof now follows the steps of the one of Theorem[2]based on the order above. Proposition 

< CiPn + nTZl). Hence, since Pn = 



B.4 



and 



B.5 



and Lemma 



B.3 



which gives E 



Pn 



o ( (logn/ri)^^^ 



S*^ = arg max \s;-E{p)- 7„\/a (p)] + E (p*) + -f^V^ {p* 



>s;^-E{pn)-c^^p 



1/2 



1 + Op 



logn 



n 



l/2^ 



E 



Pn 



+ Vari/2 {sl))-E{pr,)-C^r.pT 



CRl^nUl - C^nRlnVT + Op (pII' + n^'^Un + (^^) (pn + mil) 

C'RlnUl (1 + Op (1)) - C^nRlnVT (1 + Op (1)) ^ +00 



provided large enough. 



□ 



B.4. Proof of Theorem |4| We first introduce a set of alternatives. Let / (■) denote the 
spectral density of a centered Gaussian stationary process {ut\ .with covariance coefficients 
Rj. Define a Holder class of processes as 

(oo 
{Mi}: 1/3 < inf / (A) < sup / (A) < 3, sup |/'(A)|<L, y^\Rj\ < 
-^G[-7r,7r] ^^y_^^^-^ ^j-^ 

The next Lemma describes a family of alternatives which satisfies Assumption |R] uniformly 
for prescribed constants and a given 5a (j) ■ 

Lemma B.4. Consider a centered stationary Gaussian process {ut} with spectral density 
function f (A) = exp {g (A)) / (27r), where 



■1,0,1. 



(B.4) 



//p > 1 (iiT'd p > are such that p^p < e < 1/6 then there is some constant L > 0, inde- 
pendent of e, p, p and b = {bk,k G such that (i) \Rq — 1| < 6pe and \Rj — phj\ < 6pe 
for j E [l,p]; (a) \Rj\ < 3p (2e)^ for all j in [ip + 1, {i + 1) p) and all i > 1; (Hi) {ut} is in 
Hdlder{L); (iv) Suppose that p^ = p^(p) = (2 log log n)"*^^^ / (np^/^) for some Kn > and 
bounded away from infinity, and that p E [1,-Pn] with Pn = o ^(^'^Z ('^nloglog^)^^^^ ^ ■ 
Then the associated family of processes {ut {b,p) ;b E { — l,0,lY,pE [1,P„]} satisfies As- 
sumption^^ for any a > and a 6a (j) = O • 



Proof of Lemma 



B.4 



Rewrite g as g (A) = pJ2k=~p^kGW (ikX), bo = 0, bk = b. 



Since exp (x) = X]m=o •^™/'"^' uniformly over any compact set and maxA \g (A)| < 2pp < 2e < 
1/3, we have 



R, = r exp {-ijX) f{X)d\ = ^Y.— r exp {-ijX) {g (A))'" dX. 
For m > 0, since f^^ exp {—ijX) dX = 27r if j = and if j 7^ 0, 
exp(-zjA) {g {X))"" dX 

X bk^ / exp {i{ki + . . . + kn,- j) A) dX 



(B.5) 



1 

2^ 



P_ 
27r 



m 



p" 5Z &fci X ■ ■ ■ X bk„ 

{kl,...,km)eKm(j) 



(B.6) 



where Km is the set of m-tuples with entries in [— \ {0} so that i^Km = (2p)™ and 



Km (j) contains m-tuples in Km for which ki -\- ■ ■ ■ -\- km = j so that ^Km{j) < (2p) 



m— 1 



Proof of (i). Part (i) is a consequence of (B.5), (B.6) and inequality 2pp < 2e < 



00 (2ppr-^ ^ 

m=2 m! — 



1 which together imply that for j E [0,p], |-Rj — I (j = 0) — p6j | < p^ 
2PP' E^=o < 2ep6 < 6pe. 

Proof of (ii) . Let £p+l < J > + 1) p- Observe that Km (j) is an empty set when m < £. 



Hence it follows from (B.5) and (B.6) that |-Rj| < 2^ ^ 



2n ^m=i+l m' 



I exp i-ijX) {g (A))™ dX 



< 



r Z^m=£+1 m\ 



< p(2e)^e. 



Proof of (in). Observe that \g (A)| < 2pp < 2e < 1/3 and that therefore 



1/3 < 1 - 1/3 < exp (-1/3) < / (A) < exp (1/3) < e < 3 



for all A G [— 7r, tt] . 



Parts (i), (ii) and 0<p<e< 1/6, pp<l/6 yield that, for L large enough, 

oo p oo {l+l)p oo 

^\Rj\ < + ^\Rj\ + < l + 6pe+(l + 6e)pp + 3^(£+l)pp(2e)^ 

i=o j=i 1=1 j=ip+i 1=1 

oo 

<l + l + l + l + ^(£+l) {2eY < L. 
e=i 

Since /'(A) = ^' (A) / (A) with ^' (A) = -2pELi ^fe^^in (A;A), we have sup,g[_,,,] |/'(A)| < 
3 X 2p^p < 1. 

Proof of (iv). Let Ut = St + Yl'jLi^'j^t-j be the Wold decomposition of the process. 
Brillinger (2001) and /T^log/ (A) exp (ijX) dX/2n = pbj gives 

I-^ exp (p Y^l^^ hk exp {-ikX)) exp {ijX) d\ 



Var (st) 



2tt 



exp (p Yll=i bk exp {-ikX)) d\ 



exp j p bk exp {—ikX) j d\ 



k=i 



Arguing as in (i) and (ii) with an expansion as in (B.5) give Var (et) = 1, — pbj\ < Cpe 
for j G [l,p] and {ipjl < Cp(2e)^ for all j G [£p + 1, (£ + l)p) and all i > 1. Gaussianity, 
the choice of p in (iv) with the restriction on P„ and Wu (2005) give, for any a > 1, 
Si2a (j) < Ca < C'aj~'^~^^'^. That the other conditions of Assumption [r| hold uniformly 
in p G [1, Pn] follows from (i) and (ii). □ 
We will now define a family of correlated Gaussian alternatives. We first introduce 
some notation. Consider 7„ = (21nln?T,)^''^ and V = {2-', j = 1, . . . , J„}, 2-^" = _P„ = 
o ^ ('^/Tn)^^^^^ SO that C [l,p„] for n large enough. Define also 



2 

p^(p)=2^, p„(p) = 2p^(p) e„ = P^p„(P„) 



(7. 



^1/2^ p7/4 



1/2 



n 



K P 



oil). 



(B.7) 



Since p^Pniv) < for all p & V, en plays the role of the real number e of Lemma B.4 and 
we assume from now on that n is so large that e„ < 1/6. Consider the following log-spectral 
density functions: 

g{X;b,p) = 2pn{p) bkCos{kX), b = {h, . . . ,bpj e peV. 

fce[p,2p) 



Functions g are of the form specified in (B.4). Let be a symmetric standard Brownian 



motion process. Consider a centered stationary Gaussian processes 



Ut,n {b,p) 



exp ( ^l^iM ) exp (ttX) dW (A) 



Observe that (0,p)} does not depend on p and is a Gaussian white noise process with 
variance L Let {Rj^n {b,p)} denote the covariance function of {ut^n {b,p)}. The family J^n of 
Gaussian processes can now be defined as 



= {K„ {b,p)},be {-1, 1}^" ,per']. 



Lemma B.4 implies that all sequences {wt,™} in J^n satisfies Assumption^] and that cHolder(L). 



We now study the asymptotic behavior of the stochastic covariance sequence {Rj^n {B, P)}- 



Let Nn {b,p) be as in (3.3), that is 



Nn{b,p) = Nn{{Ut,n {b,p)},p,Pn {p)) = 



Rj,n (b, p) 



Ro,n {b,p) 



> pn{p) , j e [l,p] 



Lemma B.4-(i,ii) and ( |B.7 ) gives that Nn {b,p) = p/2 for n large enough and uniformly in 
p = 2^ E V, so that Pn{p) = 2/t^7„/ (np^^"^) = K^7nP^^^/ (nNn ib,p)). Hence the sequences 
{ut^n} in J^n satisfies condition (i) in Theorem |4| Therefore the Theorem will be proved 
if we show that supj.^ minjuj^jgjr^ P (T„ = 0) < a + o (1), where sup^;^ is a supremum over 
asymptotically a-level tests. Since the equivalence result of Golubev et al. (2010) holds over 
J-'n cHolder(L) this is equivalent to show that supj^^^ min^u^y^jr^^ Q (T„ = 0) < a + o (1), Q 
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being the distribution of the continuous time regression model 



dUn (A; b,p) = g (A; b, p) dX + 27r'/^^^^, A G [-n, tt] , 



where W (■) is a Brownian motion over A G [— 7r,7r]. This can be done as in Spokoiny (1996, 
Proof of Theorem 2.3) by bounding sup^^^^ min{;7,jgjr^^ Q (T„ = 0) with a Bayes risk, based on 
the choice of a uniform distribution for p and a Bernoulh one for b. □ 

B.5. Proof of Lemma jlj The first approximation i?o,n = cr^ (^1 + O (^^nPn^^ / follows 



easily from the definition (4.1) of the alternative. To show that the second approximation is 



valid, note that for j = 1, P„ 



2 



1/2 / 1/2 \ 2 

By the Cauchy-Schwarz inequality, lipj+iipi + ■ ■ ■ + ipp^i/jp^^j\ < J2k=i'^k — 0{Pn) for all 
j = 1, Pn, hence, uniformly in j = 1, P„, 

1/2 / d1/2\ 1/2 / 1/2 
-Rin = VTjWiO- + U = -rrW^a + O 

n^/^P^/' ' \ J n^^'Pn" \ny^Pl/' 

since P„ = o((n/7„)2/3). □ 



B.6. Proof of Proposition [T| Let us now check consistency of the test (2.3) under the 



assumption that min^gjip^] l^/'fccr^l > 1. Define p„ = {u/2)'yn'^ / [n^^'^Pn^^- Lemma jlj 
imphes that = P„ (1 + o(l)) for such a p„, which therefore satisfies 

p„ = (1 + o (1)) (Z./2) {inP'JVNnf" /n'/\ 



so that (3.4) asymptotically holds provided z/ > 3k* and the test is consistent if 1 < Pn < 



p„/2 by Theorem |2] provided the considered alternatives satisfies Assumption [R[ Wu (2005) 



gives that the alternative (4.1) satisfies for any a > 0, 



1/2 



h2a (j) < Ca — - — Yil l^^il j ^ ' ^i2a (j) = for all J > P„. 
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Hence the condition Pn = O (^(n/'^n)^^^^^ gives that 6i2a (j) < Cj ^ since the \cripj\ are 
bounded away from infinity. Moreover Gaussianity ensures that 



2 \ / l/2pl/4' 

Ik.. -^.\U,.<C..\^yjPl] = O ( ^;2!L_^ I = o (1) 



fc=i 



1/2 



which gives Var = o"^ + o (1) and maxjg[i „] Var^ (wt,n) / Var {ut^nUt+j,n) = 1 + o (1) so 
that Assumption [R| holds. This ends the proof of Proposition [I]- (i). 

Consider now the other tests in Proposition l-(ii). Define Ri j = Yl^=i ''^t,nUt+j,n/n, 
Ro,j = Et=l ^t£t+j/n, = YTtZi uiy^^^J {n - ])-nRl J {n - j) and = YTtZl ^t^t+j/ 
riRl j/ {n — j). Define also rjt = rjt n = i^J^'kLi'^k^t-k, setting ^/^^ = for A; > P„, so that 
ut,n = £t + In^Vt/ i^n^^^Pn'^^ ■ We have 



Rj — Rqj 



< 



1/2 
7n 



t=i 



1/2 
7n 



3/2py4 



n 



t=l 



+ 



In 



n^Pn 



1/2 



n-j 

t=i 



The Burkholder inequality gives, for any a > 1, 



7 



1/2 n-j 



3/2 pl/4 X] 
-r n t=l 



7n^^ - j) 



< c 



1/2 



< C 



l/2pl/4 



1/2 n-j 

2^ (^t^t+i 



7 



t=l 

1/2 n-j / oo 



1/2 n-j /j-1 



< 



t=l \fc=j+l 



< C 



-,1/2 pl/4 



n 



1/2 n-j 



< C 



1/2 
7n 



7n 



n^Pn^^ 



t=l 



< 



7n 



nPn 



1/2 



< C 



■y P 



1/2 



n 
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for all j. Note also that 
enough, since jnPn^'^ = oln^^"^) 



J2^=i VtVt+j ^ X]"=i Vt the Markov inequality give for a large 



max LRi — Rq 

j6[l,n] I 



Op max \Ri j — Rq 

\i6[l,n.] I 



0=1 
Op I n 



7^ 



1/2 n-i 



3/2pl/4 X] 



l/2pl/4\'^ 



+ 



n-j 

t=l 
1/2^ " 



7« 



1/2 



E 



n 



1 1 



n 



7a/S-l ^3a/4 



1 



(n log n 



.a/2 



Hence 



max 



Ri,j - Ro,j 



{n logn)^^^ 

Arguing similarly for the r|j give, since Jn = O (n^/^) 



max I r? , — I 



, , , max kg . - a^l = Of 



'log^/^ 



n 



1/2 



(B.8) 



(B.9) 



where the latter is from Proposition B.l Note that (B.8) and (B.9) gives (4.5). Let W} 



k,ni 



CvMk^n, ELk^n be the statistic computed under G^, A; = 0, 1, i.e. with Roj/tqj and Rij/tij. 

give 



Note that (B.8) and (B.9) gives Wi^n = W^o,n + op (1). (4.5) and Proposition B.l 



\CvM,,^ - O.Mo,„| <-J2 4^ 



i=i 



< 2 max 

ie[i,J„] 



+ max n 



n^l^R, 



^0, 



X max 

je[i,J„] 



n 



1/2 ( -'^■i.i 



Rij R\ 



^0, 



i=i 



logn\ 



l/2^ 



72 """/^Op 



1 



(n log n 



,1/2 



1 



nlogn 



Op (1) 



13 



Hence CvM^^n = CvMo,n + op (!)• For EL^, VTi,™ = Wo,n + op (1) and Xiao and Wu (2011) 
gives that maxjg[i_j^] Rk,j/^k,j ^ (2 In n)^^"^ (1 + op (1)) for = 0, 1 so that P = In n) — 



1 under Go and Gi.We now show that P {p*el = 1) — > 1 under Go- Propositions B.4 and B.5 



( |B.9D give 

P Ri^L 7^ 1) = P max >lnnl +0(1) 

= P (1 + Op (1)) max -^"^ — > Inn + o (1) 
y pe[2,j„] p-1 ' 

P I ^ _ — > - Inn for some p G [2, J„] ) + o (1) 




E 



p — 1 

"E,"^2(fi§,j/'^"-E[«§../^1) 

p-i 

^(iln^-^E?=2(l-j7^) 



> - In — 
2 



p — 1 



+ 0(1) 



< 



G 



log n 



o(l) = o(l). 



Now, observe that Proposition B.l and (4.5) give 

bpL-bp* bp* -bp* 



max 

pe[2,j„] 



p — 1 



p — 1 



< max 



o(l) 



p — 1 



< 2 max 



1/2 Rqj 



logn 



X max 

P6[2,J„] 



n 



1/2 



l/2^ 



n^'^^op 



Rqj _ 
1 



+ I max 

pe[2,Jn] 



n 



[n log n 



,1/2 



+ nop 



nlogn 



1/2 { 



Op (1) . 



Ri,j 



This, since arguing as in the bound above gives maxpg^^j^j {^BBq ^ — BPq ^j / {p — 1) 
Op ^log^^^ , implies that maXp^[2,j„] (^BP^ p — BP^ ^ / {p — 1) < log n with a probability 
tending to 1 and then P (p^^. = 1) — > 1 under Gi. Hence (4.5) gives that ELi n = BP^ ^ + 
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Op (1) = BPq I + Of (1) = ELo^n + Op (1), so that ELn converges in distribution to a Chi 
square one with one degree of freedom under Go and Gi. □ 

Appendix C: Proofs of intermediary results 

The proofs also use the notion of cumulants, see for example Brillinger (2001, p. 19) or 
Xiao and Wu (2011) for a definition. Let 

Cum (mj, . . . = r„(ti, . . .,tg) 

stands for the gth cumulants of {ut^n}- The next theorem on cumulant summability is 
Theorem 21 in Xiao and Wu (2011). These authors do not formally consider sequences 
{^i,n} but the following result is a straightforward extension of Xiao and Wu (2011). 

Theorem C.l (Shao and Wu (2011)). Suppose is stationary for each n, with 

oo 

sup ||Mt,„||g_^i < OO and sup \\ut,n - ufn\\q < (j) where ^f~'^5g (j) < oo. 

j=0 

Then there is a C which only depends on sup„ ||tit,n||^4.i and Ylf=oj'^~'^^q U) such that 

oo 

|r„(o,t2,...,gi <c. 

In what follows, we drop subscript n in expressions like -Rj,^, r„ (■) and On when 
there is no ambiguity. We denote 

(. V n-l 
- ) - (j) and KM = Yl ^^P- 



c.l. Proof of Lemma B.l[ (i) The first three bounds of the lemma follow directly from 



Assumption K which implies that K (j/p) > K (j) for all j and 1(2; G [0, l/2])/C < 
K'^'^{x) < CI{x G [0, 1]) for some C > 0. The Cauchy-Schwarz inequality implies that for any 
p G [l,n/2], E^ip) = J2p^{l-i)K,, < KM < P^" {Y.Uk^,{p)f" < Cp'^'V^ip), 
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which is the last bound in (i). (ii) Write p = l + u. Since p <Pn < n/2, the support of K {■) 
is [0, 1] and K (■) is a decreasing function, we have 

The map z/ i — > {2 + zu) / (1 + z/), z G [0,1), is decreasing. Hence, for z/ > 2, V^{j)) > 
jjJ^/^K^ (^) dz > C{p- 1). Now Vl{2) > 2 [K^ (i) - (1))^ > gives the desired 
result for Va{p)- Since K is nonincreasing, p i — i- Ea{p) is non decreasing and E/^{p) > 
for all peV. □ 



C.2. Proof of Lemma B.2, Under Tio, The proof repeats the steps of Lobato (2001) and 



Kuan and Lee (2006) using the joint FCLT of Assumption M The joint FCLT of Assumption 

□ 



M gives that the critical values are Op (1) under Hi 



C.3. Proof of Lemma B.3, Equation (5.3.21) in Priestley (1981) and Theorem C.l gives 
uniformly in j, 



Var LR 



n-j-l 



E ^ 



ii=-n+J+l 
2n 



bii +j 



n 

+ 00 



{Rl + Rn+jRn-j + r (0, ji, J, ji + j)) 



,j2,j3,j4j 



£^jE4 + i^ E ir(o. 

jl=-2n i2,j3j4=-oo 

^ OO ^ +00 



j=0 



J2 j3,j4=-00 
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C.4. Proof of Proposition B.l For the sake of brevity we assume that 9 is unidimen- 
sional. That 

'logri\ 



max 

jG[0,n-l] 



1 



J 



n 



R 



max I 1 - - ) - r? , 

ie[o,n-i] V n I ^ ' 



n J 
logn \ ^^'^ 



n 



follow from Xiao and Wu (2011, Theorem 2). Note that these authors do not consider 
stationary sequences but their arguments carry over under Assumption [R| Hence it 

suffices to study maXjgjQ p^] \Rj — R 
Assumption |p| We then now show that 
so that 

n-j n-j n~j 



and maXje[o,j5„] {r^ 



max, 



rJl since Pn/n = o [n ^/^) under 
Rj — Rj = Op (n"-*^/^). Let et = ut — Ut, 



Rj 



-^iut + et) {ut+j + ct+j) = i?j + - ^ {utCt+j + etUt+j) + - ^ etCt+j 
^ t=i ^ t=i ^ t=i 



with, by the Cauchy Schwarz inequality, 
tion Ml for Xt = ''^t (9 



Ylt=i ^tet+j \ /n < X]r=i /"^ ^^"^5 under Assump- 



t=i t=i t=i t=i 



Now, observe that Assumption M gives 6 — 6 = Of [n ^/^), maxig[i „] |rt| = op (1/?^) and 
i Ee? . 3 (.^- i t (.<■')% ^ i t {^^ . 1 1 ftl - O. (i) , 

t=l t=l t=l t=l ^ ^ 



max 



t=l 



< 



2maxtg[i_„] fct 



n-j 



n 



t=i 



This gives, uniformly in j G [1,""-] 



Rj - Rj\ <\0-d\ E 



(1) I (1)' 



+ 



e-e 



,(1) 



n 



(C.2) 
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It also follows from Assumption 



M andp„ = o (n-*^/^) that 



(1) w\ (1) I (1) 



e-e 



max 



E 



(1) , (1) 
(1) 



Op(l), and for (j) = WtW^- + 



0-^ 



max 

ie[o,p„] 



Op 



Of max 
?^ / \ ie[o,p,J 



t=i 



Op 



1/2 



n-1 



2 / 1 



j=0 



t=i 

n-l 



1 / " ^ 

Op(i)-Op Ve 



1 



7=0 



n 



t=l 



Op (1) —Op n max E 
n \ ie[o,n] 



Op(l) 



This gives maXje[o,p^] 



Rj - Rj 



Op {n ^/2) and maXp6[o,n-i] n Y.%i [Rj - Rj) = Op (1). 



The study of maXjg[o,p„] ~ TjI is similar. 



□ 



C.5. Proof of Proposition B.2 For the sake of brevity we assume that 9 is unidimen- 
sional. Since R]—R] = (Rj — Rj) +2Rj (Rj — Rj], Proposition 



B.2 



is a direct consequence 



of Proposition B.l and Lemma C.l below 



Lemma C.l. Assume that Assumptions K, and[^ hold. Then 



nEU {K\j/p) - K\j)) R, i?, - R, 
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andnj:]ZlK\j/p^)R,(^R,-R,>j = Op (^(l + n E^:, '^'^ for any pr. = 0{n^'^) . 



Proof of Lemma C.l We just prove the first equality since the proof of the second is very 
similar. Define = E LrJ = (1 ~ We have 



n-l 



n 



^ KjpRj {Rj - Rj 



Cn{p) 
Dn{p) 



< Cn{p) + Dn{p), where 



n— 1 



n 



n 



^ KjpRj [Rj - i?j 

n-l 

^ (-Rj - ^j) (Rj - R'j 



The Cauchy-Schwarz inequality and Assumption |K] gives 

1/2 / p 



Hence Proposition 
Assumptions 



K 



M 



B.l 



1/2 



/ \ 1/2 

yields that maxpg[2,p„] |C„(p)/ (^n X;5'=i I = Op (1). For 



(C.2) and ti = Xt [9] give 



max Dn{p) < Op(n ^/^) ( max Din{p) + max D2n{p) ) + Op(n ^) max -D3n(p) 

pG[2,p„] Vpe[2,|5„] pe[2,P„] y pe[2,P„] 



\ t=l 



2 maxtg[i,„] \tt\ 



n 



\ut\ max D^n{p), 
^ I pe[2,p„i 



where -Di„(p) = ?^ZlLi -^j " -^j F 



(1) , (1) 



J=l 
P ^ 

^3n(p) = LRj - Rj 



i=i 
p 



t=l 

n-j 



(1) , (1) 



t=i 



(2) 



i=i 
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By Assumption K and M and by Lemma B.3 we have 



E 



max Din{p) 



E 



(1) , (1) 



E 



max D^nip) 

PG[2,p„] 



X 



E^/2 



,(1) 



E 



max -D3n(p) 
pe[2,p„] 



<Cnf]Vari/2 j^^.j ^1/2 



(2) 



1/2 



E 



max D4n(p) 

P6[2,p„] 



j=i i=i 
The Markov inequahty gives us the stochastic orders of magnitude of the four maxima in the 

bound for maXpgp^p^j -D„(p). Since p„ = O (n^/^) by Assumption [p| maxtg[i „] ftt] = op (l/n) 



and n ^ J2t=i = Op{n ^) by Assumption M, we have maxpg[2,p^] |D„(p)| = Op (^1 + ^ 

/ \ 1/2 

Op(l). This together with maxpgp^p^j |C„(p)/ ('T'X]j=i-^j) I = Op(l) shows that the 
Lemma is proved. □ 



C.6. Proof of Proposition |B.3[ The proof of Proposition |B.3| is long and divided in three 
setps. In the two first steps, we focus on directly observed residuals. In the first step, we 
approximate the sample covariance Rj by a martingale counterpart X]"=i Djt/n, j G 



as in Shao (2011b), see the notations below and Lemmas C.2, C.3 and C.4 The second 
step deals with the deviation probability of 

n Et%+i Djt) ' {K' U/P) - (1)) - a'E^ jp) 

ct^Va (p) 
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which is approximated with some Gaussian counterparts through the Lindeberg technique, 
see Lemma C.5[ The third step concludes and exphcitely deals with the case of estimated 



residuals thanks to Propositions B.l and B.2 



Let us now introduce additional notations. Let J-fc be the sigma field generated by 
Cfc, Cfc-i, . . .. Define Pt [Z] = E[Z \J^t] -E[Z \J^t-i]- Wu (2007, Proposition 3) establishes 
that ||Pj [lit+fc]!!^ < 6a (k) and Shao (2011b) has shown that 



||P0 [UkUk-j]t < 2 hkha iS2a (k) + S^a {k ~ j) I (j < k)) 



(C.3) 



which is smaller than 4 HiifcUga ^2a {k — j) when j < k. Define now the vector of martingale 
difference Dt = [Du, . . . , Dp^t]' with 

oo 

Djt = ^Pt [ukUk-j] 

k=t 

which converges a.s. and satisfies E [Djt \ J^t~i] = 0, max^ E [l^^jtl"] < oo, provided ||Mt||2a < 
oo and YlT=o '^2a (k) < oo. Consider the martingale Mj = Mjn = XltLj+i ^jt which is an 
approximation of Rj. Shao (Lemma A.l, 2011b) gives under Assumption r| and for any 
a G [1, 6a], 



E" 



t=j+i 



< c. 



(C.4) 



We shall also use a p-dependent version of Dt, denoted Dl , with entries 



E [Djt \et, et-p+i] = YP't [ukUk-j] , where 



(C.5) 



k=t 



P't [Z] = P*-^+i [Z] =E[Z \et, et.,+i] -E[Z |e,_i, . . . , e^.p+i] 



Arguing as in Shao (2011b, Lemma A.2-(iii)) gives 



(C.6) 
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C.6.1. Martingale approximation and preliminary lemmas. An important property of Dt and 
Dl''+' is as follows. 



Lemma C.2. Suppose Assumption an(i[^ hold. Let Kjp be as in (C.l). Then for any 



p < p, t, and any s <t — p, 
Proof of Lemma IC.2[ We have 



i=i 



K D D*~''+' 



3a 



3a 



< 



+ 



+ 



+ 



We have for (C.7) 



(C.7) 



p oo 




oo 

E P* ['^t+kiUt+ki-j 


j=l fci=0 




k2=0 


P j-l 










E P* [Ut+k2Ut+k2-3 


j=l fci=0 




k2=0 






oo 


E^.pEp^ 




E P* [Ut+k2Ut+k2-j 


j=l A:i=0 




k2=j 


p oo 






E^.pEp^ 


Us+ki^s+ki-j] 


E P* ['^t+k2Ut+k2-j 


j=l fcl=j 




k2=0 


p oo 

E^.pEp^ 


Us+kiUs+ki-j] 


oo 

E P* {'^t+k2Ut+k2-j 


j=l fcl=j 




k2=j 


p-1 




p-1 



3a 



3a 



3a 



3a 



3a 



(C.7) 
(C.8) 
(C.9) 
(C.IO) 



Ut+k2~jP't [Ut+k2 

j=l ki=0 k2=0 

p-1 p-1 / p-1 \ 

E E I E KjpUs+k,-jUt+k2-j 1 Ps [Us+k^] P't [Ut+k2] 
fcl=0fc2=0 \i=A:iVfe2 / 3„ 



3a 



p— 1 p— 1 

SEE 

fci=0/c2=0 



p-1 



^ ^ '^jp^s+ki~j^t+k2—j 
j=kiVk2 



Sl2a (kl) Sl2a (h) 
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using ||P^ [Mt+fe]||i2a ^ IIP* Nt+fc2]lli2a = ^i2a (h) ■ Now (\CA^ and the Burkholder inequality 
give 



p-i 

Kjpus+ki-jut+k2-j 

j=fciVA;2 



< 



6a 



p-1 

KjpDt+k2-j,t-s+k2-ki 

j=fclVfc2 



6a 



+ 



p-1 



iiTjp {Us-\.ki-jUt+k2-j ~ Dt+k2-j,t-s+k2-ki) 

j=kiVk2 



6a 



Hence (C.7) is smaller than Cp^^'^. For (C.8), we have since {us+ki-j,j [^5^1]} 
{P[ [ut+k2Ut+k2-j] ,j e [1, ki] ,k2>0} are independent, 



(C.8) 



p— 1 00 / p— 1 

I ^jp'^s+ki-jP't [ut+k2+jUt+k2] I Ps [' 

fci =0^2=0 \j=fcl 



s+fciJ 



3a 



p— 1 00 
A;i=0fc2=0 



p-1 



-^ip^s+fei-jPt Nt+fc2+j'^tH 



-k2\ 



Sea {h) ■ 



6a 



Let dt = YlT=t^t[uk] be the martingale difference approximation of Ut, see Wu (2007). 
Now, since {us+fci-j, 4+fci-i,j e [l,^i]} and {P[ [ut+k2Ut+k2^j] J ^ [1, fci] , /c2 > 0} are inde- 



pendent, arguing as in the proof of Theorem 1 in Wu (2007), (C.4) and the Burkholder 
inequality give 



p-1 



KjpUs+ki-jP't ['^t+k2+jUt^ 



-k2\ 



j=ki 



< 2 



< C 



6a 



p-1 

y^ Kjpdg^ki-jPt [^t+k2+j'^t+k2 
i=^^ 6a 
p-1 

5^ Kjpd1^k,-j (P* Nt+fc2+iWt+fc2])^ 



p-1 



-ft'jp (ws+fc^.j — (if) P'j ['Ui+fcj+i'^t+te 



+ C ||P; [Ut+k2+jUt+k2]\\la < CkiSl^ (fcs) • 



3a 



Hence Assumption iRj gives (|C.8|)< Xlfciio X]fe=o ^I'^L (^2) '56a (A^i) < C. 
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For (C.9), observe first that (C.4) gives 

oo p—1 p 



(C.9) 



fcl=0fc2=0 j = l 



oo p— 1 p 
ki=0 k2=0 j=k2 



1 6a 



» \ p—1 p 

2'^t+k2- 

\ki=0 / k2=0 j=k2 



Since u\j^j,,^_j is independent of et, . . . , et_p+i and [wi+fca] 



+k2-j\ Ilea 



< 



E [M*+fc^_^.Pt [ut+k2] let, . . . , ej_p+i] 







6a 



Substituting gives tliat ^< C ECi E^=fc, 0i2a (A^s - j) 5i2a (A:2) < C. 
For dClol , ( |a3| and (|aTT| give 



p / oo 



3a 



+ ||E [{ut+k2-j - u\+k2-j) K+fca] let, • ■ ■ , et-p+i] llg^ 

< \\ut+k2-j - u\+k2-j\\i2a IIP* K+te]lll2a < ©ISa (^^2 " j) 5l2a (h) ■ (C.ll) 



2-jJ II 6a 



p / oo 



i=i \ki=j / fc2=i 



Hence substituting gives 



3a 



□ 



We now define a suitable sequence of Gaussian vector. Let 2p„ < d < 3p^ be an integer 
number. Consider a sequence of independent centered Gaussian vectors rjt = [rju, . . . 
witli 

E[r/,,,r^,,,]=E[D™D™]. (C.12) 
We sliall also assume that {rjt} and {et} are independent. 
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Lemma C.3. Let {r]t} be as in (C.12) and suppose Assumption holds, 
p e andt,s G [l,n], 



Then for all 



Pn 



^ I Gov ir]j^t,'nj2t)\ <C and^ |Var {rjjt) - ct'^\ <C, 
ii7^i2e[i,p„] i=i 



1/2 



Proof of Lemma C.3 (C.4) gives for all ji, j2 



Gov (-Djit, -Djai) = 1™ CIo"^ 



Er=ji+i ^t^t-ii J2t=h+i '^t^t- 



oo 

J2 \ 



/ ■ \l/2 ' / ■ \l/2 I 

(^-^i) \n-22)' ) k=-oo 



see also Lemma A. 2 in Shao (2011b), provided Xlfcl-oo 1^ ['"o^ii'Wfc'Wfc+i2]l < oo as shown 



below. (C.6) and (C.12) give 



max 

iij2e[o,p„] 



Gov (r/j^i, r/j2t) - ^ E [uoUj^ 



UkUk- 



'32 i 



<Ce,2aiPn)- (C.13) 



Now relation between cumulants and moments in Brillinger (2001) and Theorem C.l gives 



absolute summability of the 4th moments. Hence Qi2a iPn) = ^(Pn^) gives the first bound 
of the Lemma. For the second and the third bound, observe that under the null 



E [uoUjUkUk+j] - (J- 



fc=— oo 



< |E [uluf^ - E [ul\ E [uf^ I + 2 



E [uQUjUkUk+j] 



k=l 
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|E [ulu'j] - E [ul] E I < CQi2a (j) = O and absolute summability of the 4th mo- 
ments gives the second bound. This also gives the fourth one since 

1/2 



2 \ 1/2 

J \ r^2 N ^4\2 



ir,, (Var iv.t) - 



< C. 



For the last one, observe first that 



^ I Gov [jij^t, Vj2t) f < { I '^^Ji*' I I < °° 

i<ii<i2<p„ \i<ii<i2<p„ 



by Theorem C.l since the 2th cumulants are the covariance. This gives, for any z 



[zi, ■ ■ ■ , Zp^] , 



Var [z'r]) = z'E [r]r]'] z < ^ Var {r]jt) z^ + 2 ^ | Gov {r]j^t, r]j^t) \ \ I I Zj^ \ 

i=i i<ji<i2<p„ 

1/2 / \ 1/2 



\ i<ii<i2<p 

n 

< Cz'z. 



E 

i<ii<i2<p 

n 



2 2 

^ii^i2 



Hence Var {Ylj^iKjpDjsrijt l-D^ j < C (^Zlj=i -^Ip-^i^) since {A} and {r/J are indepen- 



dent. 



□ 



C.6.2. r/ie deviation probability of the maximum of Proposition B.3. The proof is based on a 
smooth approximation of the maximum of real numbers Xi, . . . , Xp^. Consider an increasing 
and three times continuously different iable real function / with 



lim / (x) = 1, f {x) = X for x >2, max sup l/*-*^ < oo. 

2=1,2,3 X 



(C.14) 



26 

Let e = Cn oo with In {pj /e ^ o(l). Then maxp6[i,p^] {/ (xp)} < (^Ep=i {xp)j < 
pI/^ maxpg[i,p^] {/ (xp)} gives that 

^r(a;,)j = (^l + 0(^^)) majc^{/(a;,)}. (C.15) 
We will first find a suitable approximation for the distribution of 

M-{f:r{sp)^ ^hereSp^n±K,p^'^y, Sp^^^^^. (C.16) 

Define, for 77 = [r/i, . . . , %^]' and x G [0, 1], 

s=j+l s=t+l 

n Yf^=i KjpRl {x; rf) - a^E^{p) 
Spt{x;v)^ ^^y^ ^^^^ , ^t{x;v) ^ f {spt{x;v)) : 

_ 1 

J2^{x;v)j , Mtiv) ^Mt il;v), (C.17) 



and 



.(1) _ dspt {x; ri) _ 2 Ej=i Kjp (Es4+i -^^'^ + + ^^=t+i 

.(2) . . _ ^) _ 2 Ej=l Kjplf^ 

(x;77)=/« (s,,(x;r?))4;) (xjTy), 

(x; 7?) = {spt {x; v)) (4;^ (x; r?)) V /(^) ( ^ (^; ^)) 4? (^5 ^) ' 

S^? {x; V) = /^'^ {x; V)) (4i^ ' + 3/^'^ {spt {x; r/)) {x; r/) 4? {x; r/) . 



We first bound the moments of t}^^ {x;7]), Ti^^ {x;?]) and sg-* {x;7j) when 77 is set to Dt or 
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Lemma C.4. Under Assumption^ and if = O (n-*^/^), we have uniformly in p E 
X G [0, 1] and t = 1, . . . ,n, 



max 



I- 3a 3a J 



c 



3a J 



max 



4? 



3a/2 



< 



Cp'/^ 



max 



a J 



3a/2 J n 

Cp^ 

3/2 ■ 



n 



(C.18) 
(C.19) 

(C.20) 



Proof of Lemma C.4 (C.14) gives 



< c 

< C 



Spt (x; T]) , {x; r]) < C ^4*^ ^)) + 4? ^) 

4:'(^;.)|((4i'(^;.))V|4?(^;.) 



(C.21) 



(C.21) shows that the lemma directly follows from 



max 



3a 



^(1) 
^pt 



.x;vt) 



3aJ 



max 



3a/2 



^(2) 



x; Vt) 



^ Cp^ 

3a/2 ) ~ n 



(C.22) 
(C.23) 



( C.23 ) directly follow from the triangular inequality. For ( C.22 ) , we first bound 



^pt [x] 



3a 



We have 



^pt {x] 



< C 



3a 



np 



1/2 



3a 



+ c 





+ c 


l^s=t+l 






3a 





np 



1/2 



(C.24) 



(C.25) 



3a 
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We have, for the first item (C.24) 



(C.24) < 



np 



1/2 



< 



< 



np 



1/2 



np 



1/2 



3a 



3a 



3a 



np 



1/2 



3a 



1 



np 



1/2 



C7pl/2pl/2 



t-1 



s=t—p+i 



6a 



where p > p and by the Burkholder inequality. Now let Djt = -D*^ ^^'^ be as in (C.5). Since 



Yyj=i ^jpDjsDjt is a difference of martingale given et, . . . , ef_p+i, (C.6), the Burkholder and 



triangular inequalities, Lemma C.2 give 



np 



1/2 



< 



spt-p spp K D D 



1/2 



1/2 



3a ' i=l 

2 \ 1/2 



np 

- npV2 1 



t-p 

s=l 



6a 



6a 



3 = 1 



jt 



c 



06a (P - P) 



1/2 



3a / 



,06a (P -P) 



np 



1/2 



P 



1/2 



P 



1 , 06a (P-P) 



?7, 



1/2 



P 



1/2 



Hence substituting gives 



I^Li (5^i=i ^ip^jsDjt 



np 



1/2 



< C 



n 



1/2 



P^V^^06a(p-p) 



n 



p 



1/2 



(C.26) 



3a 
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For the first item in (C.25), (C.23) gives a bound C jn^l"^ . For the second item in (C.25) 
conditional Gaussianity of the | X]j=i ^jp^jtVjs | and Lemma 



C.3 



give 



np 



1/2 



3a 



< 



c 



< 



np 
C 



1/2 



n / p 



.s=t+l \j=l 



n p 



1/2 



1/2 



< 



c 



np 



1/2 



3a 



n / p 



s=t+l \j=l 



1/2 



3a/2 



np 



1/2 



'i*ll3a 



<^((n-t)p)^/'< ^ 



^s=f+l j=l 



1/2 



n 



1/2- 



Substituting the two last bounds in (C.25) and (C.26) in (C.24) shows that 



max 



3a 



3a 



n 



1/2 



p 



1/2 



(C.27) 



Observe that Oga (p — p) < C* (p — p) ^^^^ by Assumption |r[ Consider now 



p = max 2p, 



n 



p 



>2p, 



which is such that, since p E with p^ = O (n^/^), 



If ( - r > 



1/2 



1 1 

< - < < 



n 



n n 



5/6 — ^1/2 



26 



1/2^ 



n n 



1/2- 



Hence (C.27) gives (C.22). 



□ 



Let / (■) be a three times differentiable real function and define for Aitiv) in (C.17) 

dll {x; 7]) 



d^x 



J = 1,2. 



Observe that I (M) = I (Mn {Dn)) = In (Dn), It {Dt) = It+i (vt+i), and that / {Mi (r^i)) 
= Xi {rji) is a function of the Gaussian vectors rji, . . . ,rin only. 
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Lemma C. 5. Let Ai and M.i{rii) he as in (C. 16) and (C. 11). Consider a real function I {■) 
which may depend on n and three times continuously dijferentiable with maxj=i 2,3 sup^ x) < 
C. Then under Assumptionslfi \r\ and if e = O {pl/^'^"'^) , 



\E[I{M)-I{Mi{vi))]\<C 



Pn 



1+3/a ^ 



^1/2 ,f^l-l/a 



Pn 



Proof of Lemma C.5, The proof of the Lemma works by changing D„ into rjn, -D„_i into 
rjn-i and so on, the so called Lindeberg technique described in Pollard (2002, p. 179). This 
amounts to decompose / {Ai) — I {Ain {Vn)) into the following sum of differences. 



nM)-nMn{nn)) 

= In (Dn) - In-1 {Dn-l) + Xn-l {Dn-l) - Xn-2 {Dn-2) + " " " + Xi {Di) - Xi (r^i) 
= In (Dn) - Xn (Vn) + Xn-l (Dn-l) - X„_i {Vn-l) H h Xi (Di) - Xi {r]i) . 

Since Xt{ri) = Xt{l; rj) and X((0; rj) = Xt{0), a third-order Taylor expansion around rj = with 
integral remainder gives 



X{D,)~XM] = E Xf)(0; A) -Xr{0;Vt) 



-(1)/ 



+ -E 
2 



xf)(0;A)-Xr(0;r^,) + - {1 - x)'E X^' {x; D,) - Xr{x;vt) 



dx. 



Since [Dt] is a sequence of martingale difference, E Xj^''(0; Dt) — xf^''(0; rjt) 
expression of X^^^ (0;^]) given above. Hence 



due to the 



\E[I{M)]-E[I{Mi{r^m< 



X^E[xf (0;A)-Xf)(0;r/,) 



t=i 



^(3), 



+ 2 i^-^f\Y.\^Vt'\^:Dt)-Xr>{x-r^t) 



t=i 



dx. 



(C.28) 
(C.29) 



We now compute the differentials If {x;r]), j = 1, 2, 3. We have 
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I^'^ {x;7i) = I' {Mt {x;v))Ml'^ {x;rj), 

{x; rj) = I" {Mt {x; r/)) (a^ « {x; r/)) ' + I' {Mt {x; r/)) Mf^ {x; r/) , 
Xf ) {x; 77) = {Mt {x; r])) (a4 ^ {x; n)) ' + 3l" {Mt {x; 77)) « {x; 77) A^f (x; 77) 
+ /' {Mt (x;77))A^f (a:; 77). 



We compute the differentials of Mt- We have 



\p=i / p=i 

= A^^^ (x; 77) E;-i (x; 77) E^i^ {x- 77) , 

^ (x; 77) = (x; 77) + M^2t ^) + (x; 77) , 
M^'^ {x- 7]) = Ml? (x; 7^) + . . . + A^g) (x; 7/) , 
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where, dropping the variables x, r] for notational convenience 



'■5t 



p=l 



-2 fy^iiy 



p=i 

Mg' = 3 (i - l) Ml-"- E-'E™ X: EJr'E 

V / p=i p=i 

^ - 1 (e - 1) -Mr^^ E Y: (s 

^ p=i p=i 

All? = (3e - 1) M'r E 



^1^(2) 

-'pt ) 



2 r 



p=i 



p=i 



2 

pt ; ) 



-lv(3) 
pi • 



p=l 



The third-order item(C.29). Since 



t=i 



E 



33 



it is sufficient to bound Ylt=i ^ '^t^\^) independently of x where I^'^^x) stands for 



(3), 



If^\x] r]t) or If'^'{x] Dt). We have, dropping dependence w.r.t. to x for ease of notation, 



r(3) 



n 

t=l 



(3) 



< 



t=l ^ 

n ( 



Ml 



(1) 



+ E 



+ E 



t=i 



M 



(3) 
3t 



We now study the ten items above. 



(1) ELiE 



(1) 



We have for a, a > 1 with 1/a — l — l/a, 



E 



M 



(1) 



E 



p=i 



< E E 

Pl,P2,P3 = l 



< max 

p,t 



^pt 



Pn 



3a 



J2 ev« 



P1,P2,P3=1 



A-f3(l-e),pe-lve-lve-l 
•'^It ^pit ^P2t ^P3t 



< 



n 



C 

3/2 



E E 



l/ci 



Pl,P2,P3=l 



»^3(l-e) v-ie-1 v-ie-lv-ie-l 

•'^h ^pit ^P2t ^P3t 
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by (C.18) for all x e [0,1]. Now, since t t'^/", t ^ t^"^/^ are concave and Y7p=i^l < 



(X]p=i '^p) ■> the definition of M.t gives 



^ ^l/a 
Pl,P2 ,P3 = 1 



A /< 3(1— e)y,e-lyie-lyie— 1 



^3 - " W}'" 

P1,P2 ,P3 = 1 



.Pl,P2,P3 = l 



» ^3(l-e)y,e-lyie-lyie-l 



3a(l— e)r-iae(l— 1/e) Y-iae(l — 1/e) Y-iae(l— 1/e) 
^Pit ^P2t ^P3t 



1/a 



pi 



-3a(l-l/e) 



-3(l-l/e) 



Pn 



p=l 



ae(l-l/e) 



,p=l 



ae 
pt 



. Pn 



3(l-l/e)- 



p=l 



ae 
pt 



1/a 



1/a 



< p3(l-V^)+3/{e^) < ^p3/a^ 



uniformly w.r.t. to t since (lnp„) /e = o(l). Hence for all x G [0, 1] 

n 

t=i 



(1) 



< C 



p: 



;3/a 



n 



1/2- 



(C.30) 



(2) ELiE 



We have, since A^i > 1, 



E 



M 



(1) 



< CE 



< CE 



7W 



2-3e 



Pn 


3" 






Pn 


3" 


/_^^pt ^pt 




< CE 


_yy^3-3e 


2_^^pt ^pt 




p=2 








p=2 





(1) 



for all t, such that X]"=i E 



-Ml 



(1) 



nt 



. Hence a bound similar 



to (C.30) holds. 
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(3) Er=iE 



. Let a > 1 be such that 1/a = 1 — 1/a. Arguing as for (1) 



with (C.18) and (C.19) 



E 



(1) A^(2) 



M\'M 



It 



Pn 

< C ^ E 

Pl,P2 = l 



M 



2(l-e) 



ye— lyie— lyi(l) y (2) 



< C max 

p,t 



3a 



.(2) 
-•pt 



3a/2 



Pn 



Pn 



J2 E^/^ 

Pl,P2 = l 

2a(l-l/e) 



A^2(l-e)^e-lve-l 



j^l/2 



J^ea{l-l/e) 

,p=l 



^1/2 

^ ^3/2 ^Pn^Pn " ^ ^3/2 " 



Hence, uniformly w.r.t. x G [0, 1], 



1 

Pn 



^ p=i 

Pn 

p=i 



ea(l-l/e) 
pi 



eS{l-l/e) 
pt 



n 

Ee[| 



< C 



-|(l+4/a) 



1/2 



(C.31) 



(4) ELiE 



. Proceeding as (1) and (3) gives, since infp t > 1, 



E 



<Ce ^ E 

Pl,P2 = l 



M 



2{l-e) 



-'pit ^P2t ^Plt \ ^P2t 



provided e = Hence E"=iE 



can be bounded as in (C.30). 



(5) ELiE A^i^ 



,(3) 



CE 



7W 



3(l-e) 



Z^p=2 ^pt ^pt 



can be bounded as in (C.30) since M.t > 1 gives E 

3" 



nt 



< 



(6) EJLi^ -^2? • Arguing as in (3) gives that ^^=1 ^ 



'■2t 



can be bounded as 



in dOMj ). 
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(7) Er=iE 



'■St 



. Arguing as in (4) shows that this item is neghgible compared to 



(C.30). 



(8) ELiE 



M 



Let a > 1 be such that 1/a = 1 — 1/ a. We have, since infp ^ S^j > 1, 



E 



M 



< CeE 



p=i 



<ceY,E'/-^\{M'rm'y 



.(2) 
-'pi 



P=Po 



3a/2 



-(1) 
-•pt 



3a 



3/2 — ^3/2 ' 



provided e = O (pi/"")- This gives a bound similar to (C.31 ) for X]"=i ^ 



(9) ELiE 



(10) Er=iE 



can be bounded as in (C.30) provided e = 0{p. 



.l/(2a)^ 



M 



can be bounded as in (C.31). 



Hence, collecting the dominant bounds (C.30) and (C.31) in (I)-(IO) gives 



[1-X 







n 



E|xf)(x;A)-Xr(x;r/i 



(3), 



dx<C 



Pn + Pn 



-1+ 



72 



1/2 



< C 



(C.32) 



(1') 

The second-order term (C.28). Note that X]: (0; r^) = rj'Atr] where At depends 
upon Di, . . . , Dt-i and r^t+i, In the standard Lindeberg method, {Z^j, t G [1, n]} and 

{r^f,^ G [1,''^]} are both independent variables with identitical mean and variance, so that 
the second order term, which writes as a sum of items E [D'tAtDt\ — E [//{A^r/t], is equal to 
in this simpler case. However this does not hold in our case. In this step, the second order 
term is dealt with by removing from x'f'\{^] rj) a block Yl^=i ^jp Yll~^t-i ^js ^^"^ changing 
the D^t into D'r/+^ = E [Djt \et, et^e+i]- 

Observe that xf ^(0; r]) = xj^ (0; rj) +I^\0; v) v) +x2H0; r]) with, dropping the 

dependence upon and r], 



37 



t(2) _ 



(1) A/<l-2e 



1 1 lin'M, 



pt 



r(i) 



I' {Mt) , 



p=l 



p=l 



Pn 



1^(1) 



Observe A^t (0; Dt) = Mt (0; fjt) and E^t (0; D^) = Sp^ (0; r^t) and that these quantities do 
not depend upon rjt or Df. We shall first focus on l[f . Let £ > 2p„ be an integer number. 
Define, for y e [0, 1] , 



©pt (2/) = ©pt (i/; yDt + ii-y) A*"'""') , 



, "^pt (y) = Ipt (y; y A + (i - y) , 
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which are such that &pt (1;^) 
also 



4i\0;r/), 6,,(1) = 4i\0; A), '^pt{l) = 4*(0; A)- Define 



t-i-l t-l n 

s=j+l s=t-t s=t+l 

nY.UK,,-R%{y)-a'E^{p) 



Mjt (y) 



n 



Spt (y) = 



-, '^pt{y) ^ f{spt{y)), 



Sj? (y; v) = f^'^ (spt (y)) %pt (y; v) + /^^^ (s,, (y)) (6,, (y; 77))^ , 
(y) = £ff (y;yA + (l-y)A*-'+^), 



and the counterpart of x'^^ (0; 7]t) and X^j'' (0; Dt) as 



(2) 



a* (y; 77) = I ^ - 1 j (y) 971,^-^^ (y) (^XJ ^^^^ 4'*^ ^) ) ' 



(y) = a, (y;yA + (i-y)A*"'^')- 



Observe that xjf (0; rjt) = (1; 77*) and xjf (0; A) 
Ept(l)-at(l;77t)] and 



= (1). Hence E xf^ (0; A) - ^^'tt (0; ^t) 



r(2) 



E 



Xj? (0; A) - X}^^ (0; 77,) = E (0) - 3t (0; r;,)] 



.(2) 



+ / E 
'0 



3^l\y)-3t'{y-r^^) dy, 



(1) 



(C.33) 
(C.34) 



where ^ (y) = da* (y) /dy and 3^'^ (y; rjt) = dJ* (y; r^t) /dy. 



(1) 
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We first consider the integral item E 3[^^ (y) 
jS;) iv). Define 



dy from (C.34) and first compute 



^ de,t {y) ^ 2 (El=l. ^.s) {yP^ + (1 - z/) 

dy na^V^{p) 



+ 



na^VA (p) 

na^A (p) 

'P TV /r / N r-, 



(y) = 


dipt (y) _ 
dy 


4? (y) = 


dspt (y) _ ^ 
dy 


iy) = 


rfS^i^ (y) _ 
dy 


iy) - 


_ rfS^? (y) 
dy 



na^VA (p) 

f^'^ i^pt (y)) si^^ (y) Gpt (y) + (s,* (y)) ©g^ (y) 



f'^ i^pt (y)) 4'^ (y) 'ipt (y) + f^'^ {spt (y)) 'lH^ (y) 



+ f^'^ (V (y)) sj,i^ (y) (6p, {y)f + 2f^'^ (s,, (y)) 6,, (y) (^) ^ 



aS?(i/) = /"(9H,(i/)) 



and 



:iS:^ (y) 



Pn 



Pn 



1 ) (y) mr'^ (y) Yl ^li' (y) ^Pt (y) E ^p^' (^) (^) ' 

\p=i / p=i 

I - 2] a« (y) m]-''^ (y) ^7' (y) iy)) E ^7' (y) sff (y) , 
^ \p=i / p=i 

{y) = 2 Q - (e - 1) aS:,) (y) ojt,^-^^ (y) (y) s« (y) j s;-2 (y) (s« (y) 

{y) = 2(^-1) ^ {y) ^l'^ {y) f E ^li' (y) {y)] f E ^It' (y) ^^''^ (y) 



.p=i 



.p=i 
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To bound the moments of Sp^-* (y), Spi'^'' (y) and 12^]-' (y), consider first \\& 

and 



^(1,1) 



6? (y) 



and 



4'^ (y) 



3a 



For \\&pt 



3a 



3a 



^pt {y) ; (|C.18|), the Burkholder inequahty, (|C.6|) 



3a 



p„ = O (nV2) , 2p„ < £ < 3p„ and Bga - pj < Cp^^ give 



16 



Pt Ky)\\3a 



< 



3a 



2|l-Z/lE 



1^: 



\s=j+l 



^72 + -+ (- 



1/2 



e6a(^-pj < 



c 



\^jt-^jt I 



1/2- 



©it^ (y) 



3a 



< 



na^VA (p) 



3a 



t-i 



s=t-t 



l^Ji-'^jt Ilea 



6a 



t-1 



^ ncT'VA (p) 

^1/2 ^l/2pl/2 



s=j+i s=t-e s=t+i 



I^J*"^it Ilea 



<c\ _ + ^^e6a(^-pJ+ - e6a(£-p„ 

n n \ n 



Pn 



1/2 



n 



{npn. 



1/2 / ' 



11^, 



pt Ky)\\3a 



< c 



Pn 



1/2 



n 



2: {y) 



< 



c 



3a nPn 



For 



(y) 



3a 



(C.18), p„ = O (n^/^) and the Burkholder inequality give 



sff (y) 



3a 

t-1 p 



2 \^ \^ 

s^=t-l j=l ^ 

2 Sj=i -^ip (X]s=t-£ ^js) {J2s=t+i Vjs) 



< 



+ 







( E ^^^0 ^^^^ 


+ 


\s2=i+i / 


3a 



2 X]j=l -^JP (X]s=t-£ ^js) 



na^VA (p) 



3a 



3a 



tict^Va (p) 
^ K f ^^^^^ 



2 \ 1/2 



3a / 



\Kjp\ 
^ na^VA (p) 



t-i 



s=t-e 



6 



+ C 



1/2 



1/2 



3a 



ni/2 n 



1/2 



n 



1/2 



These bounds and (C.14) give, uniformly in y, p and t, 

C 



4'^ (y) 



< 

3a 



sii^ (?/) 



< C7 

3a \ n 



Pn 



1/2 



3a/2 



n \ n 



n 



Now, arguing as for the study of (C.29), e = O [pH"') give uniformly in p, t and y 

::l/2+3/a 



E 



,(1) 



,(1) 



< c 



Pn 



n- 



3/2 



E 



< c 



l+3/a 



n 



3/2 



It then follows YJU lo ^ N'^ (2/)! dy < Cpl+^yn^/'. Since Er=i /o ^ N'^ (^^ 



satisfies a similar bound, we have for (|C.34|) 

n 



t=i 



E 



(y) - 31'' {y; Vt) 



(1) 



< C 



„l+3/a 

"n 

^1/2 • 
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Consider now (|C.33|). Since and r]t are independent of the Z'in (0), Wl]''^^ (0) and 



(1) 



Tipt (0), we have using (C.12) 



E[3t {0)-3t {0;vt)] 



n 

p 



1 (0) 971,1-^'= (0) 



Pl P2 



E (0) S^/ (0) / (S-/ (0)) / (S-/ (0)) i: i: (E - E [r^,,,r^,,,]) 

Pl,P2 = l jl = lj2 = l 



^jlPl ( Ssi=ji + 1 -^jlSl + '^Si=t-e''ljlSl ) -^i2P2 (Ss2=i2 + 1 -^i2S2 + Ss2=t-£^i 



2^2 



nVV^-l/A (Pi) 



n 



Hence (jOSS]) and (jaMj) give 

n 

5^E[xi? (0;A)-Xi? (0;r/,) 



< C 



„l+3/a 
"n 

nV2 ■ 



To study 



E 



4f (0; A) - X^i^ (0; r^t) 



(2) 



, observe that, uniformly with respect to p, t and 



max 



4? (y) . ' ^Pt (y^ vt) 



max 



^(2,1) 



3a/2 



;(2) 



1/2 



3a/2 / n 



^(2,1) 



< c 



Pn 



n 



3/2 



Arguing as for ELi ^ 4'^ (0; A) - Xj^^ (0; r^^) gives ELi ^ X^,^^ (0; A) - X^^^ (0; r^^) 



r(2) 



.(2) 



K2) 



< 



C ( + 2^ ) , and provided e = O {tj^^"^) 



X^Efe? (0;A)-Xg) (0;r/,) 



It then follows 



X^Efcf (0;A)-Xl? (0;r^,) 



< C 



„l+3/a 
fn 



n 

J]E[xf (0;A)-Xf) (0;r/,) 



^l+3/a 1 

< c ( ^^^ + 



^1/2 ^1-1/a 



(C.35) 



Substituting (C.32), (C.35) in (C.29), (C.28) shows that the Lemma is proved. 
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C.6.3. End of the proof of Proposition B.3. The rest of the proof is divided in 3 steps. 



Step 1: Martingale approximation. Let Sp and Sp be as in (2.5) and (C.16) respec- 
tively. Let a = 4a/3. The Cauchy Schwarz inequality gives 



~ I " f 1 
j=i \ 



t=j+l 



X 



nV2 

1/2 



UtUt-j 

t=j+l 



< 



c (t I i^U^' - t 1 it I f^'.. + t 

\j=l \ t=j+l / j \o=\ \ t=j+l 



1/2 



UtUt^j 



Hence 



Sp Sp 



a/2 



7 = 1 \ t=j + l J / \i = l \ t=j + l 



2\ 2 



Observe now that (C.4) gives 



< 



\i=i V t=j+i 



2\ 2 



UtUt- 



n ^ 



i=i 



t=j+i 



1/2 



< C 



n 
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Since the Burkholder inequality and maxj E < oo give max^gji^p^] E-*^/" [|Mj„|°] < 

Cn^/^, we also have 



E« 



< 



, i=i \ t=j+i 



2\ 2 



t=j+l 



1/2 



< (^5^^2E^[|M,„r]+E^ 



utut-j - M. 

t=j+i 



< 



1/2 I rv\ 



piCn^^ + C) 



n 



1/2 



It then follows that 

\o/4 



Sp Sp 



a/2 



< Cp/ri^^'^ and them maXpg[i p^] E 



C (Pn/n) . Hence the Markov inequality gives 



Sp - Sp) /pV2 



a/2' 



< 



P I max 

, pe[i,p„] 



Sp Sp 



p 



1/2 



p=i 



Sp Sp 



p 



1/2 



> t 



< ^ max E 

pe[i,p„] 



Sp Sp 



p 



1/2 



< 



C_ 



and Pn = o [n^l^'^'^'^+^l^))^ gives 



max 
pe[i,p„] 



Sp Sp 



p 



1/2 



Op(l). 



(C.36) 



Step 2: some Gaussian approximations. Let 7^ = 7„ (1 + e/2) / (1 + e). (3.1) gives 
7n > 7n ^ 7n = (21nlnp„)^^^ (1 + e/3). Consider a three times continuously differentiable 
function l{x) with maxj=i^2,3 sup^ L*^'^^ (x)| < 00 and I (x > 0) < l{x) < I (x > — e). Let 



X(x) = l{x — 7^). Let Sp be as in (C.16). Then Lemma C.5 with e = pll^'^°'\ (C.14) and 



45 

( C. 16 ) , and Assumption [rI give 



P f max {s,} > 7^ < P (-M > In) < E [X (M)] 
< E [X {M, ivi))] + o (1) < P (A^i (r^i) > 7; - e) + o (1) . 

We now look for a more explicit expression for the RHS. Recall that A^i (r^i) = (^Yl^=i i^pi 
Consider Q (p) = [ui, . . . ,Up]' where the cUp's are i.i.d. standard normal variables, 

/C (p) = Diag ((1 - j/n) Kjp,j = 1, . . . ,p) , 
Cr, (p) = [Gov {rij^t, Vj2t) J1J2 = 1, . . . ,p] , 
V, (p) = C]l^ (P) /C (p) C\l^ (p) , 

and (p) = Diag ((1 — j/ n) Kjp Var {rjjt) ,j = 1, . . . ,p) the p x p diagonal matrix obtained 
from the diagonal entries of Vrj (p). Then the Spi (1; 771), p = 1, . . . ,p^, have the same joint 
distribution than 

~ _ n{p)'V,{p)n{p)-a^E^{p) _ 

Sp- a^V^ip) ' P-'^---^Pn^ 

so that M.\ (r^i) and M. = ( ^^=1 (sp) j have the same distribution, and then 
P ( max Is,} > 7O < P (-M > 7; - e) + o (1) . 

Define now 

(p)' (p) (p) - a^E^ (p) _ E •=! (1 - ^ ) Var {^,^) uj] - a'E^ (p) 



Sp 



ct^Va (p) a^VA (p) 

Then uniformly in p = 1, . . . ,p^, 

n{p)'{v,{p)-v, {p))n{p) 



I I 



ct^Va (p) 



<C |Cov {r]j^t, Vj2t) I l^ii I \^j2 1 = Op (1) 

i<ii7^i2<p 
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by Lemma C.3 Hence since / (x) < 2 V x by (C.14) and using (C.15) 



M<{l + 0{^)) max {2 V s,} < (l + O ) 2 V max {5,} 



< 1 + 



111 71 

-7^ ) ) max {s„} + Op (1) . 



Define now 



p \ 1/2 

2 



Va (p) 



which is such that 



\sp — Sp\ < \tip\ + |e2p| where 
a^VA ip) 



^2p 



{ (1 - a Var (r^,,) - a^} K,,u] - a' E?=i ^^.p 



Since K' {■) is continuous on [0,1], the Weierstrass Theorem imphes it can be uniformly 



approximated with a sequence of polynomial function. Hence (C.l), Assumption K and the 
LIL for weighted sums in Li and Tomkins (1996) gives that 

|Va {p)sp\ 



lim sup 
P-5-00 (2 Inlnp) 



< ^2 y K"^ (t) dt^ , almost surely. 



Since, under Assumption K, Va (p) /p^^^ ^ (2 / K^{t)dty^^ by convergence of Riemann 
sums, this gives 

\V2 n _L ^™ ^^ w (C.37) 



sup |Sp| < (21nlnp„)'/'(l + op(l)) 

P6[2,p„] 



Observe also that Lemma 

p e [i,p„] 

Va (p) 



B.l-(ii), p„ = o(n^/^), and Assumption K give uniformly in 



1/2 



Hence 



max Cip 

P6[2,p„] 



2 In lnp„ ^ 



1/2 
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Now, for maxpg[2,p^] |e2p|, we have by Lemmas B.l 
K 



fii) and C.3 



Pn = o (n^/^) , and Assumption 



1 ^ ^3/2 



max \t2p\ < C <^ IVar (rjjt) - cu • + - jw- + 
.j=i j=i 



Op(l) + Op(^) =Op(l) 



Hence maxpgp^p^j |sp — Sp| = Op (1) and substituting in the bounds for P (maxpg[2_p^j {sp} > 7^) 
and M above gives, by (O), 7^ = 7n (1 + e/2) / (1 + e), 7^ > (21nlnp„)^/^ (1 + e/3) and 



(C.37) 



P max > 7; 



P 1 + 



In n 



max^ {Sp} + Op (1) > 7; - e + o (1) 



^l/8a / / pG[2,pJ 



<P( max {Sp} > (21nlnp„)^/^(l + e/3) ) +0(1) 



0(1). 



(C.38) 



Step 3: Conclusion. Propositions 



B.2 



and 



B.l 



Lemma 



B.l 



and = O (n^/^) , the 



expression of 5*^ and Sp in (C.16) and (C.36) gives 



max 



(Sp - ^1) /Rl - Ea (p) (Sp - ^1) - RIEa (p) 



/ N max 
Va [p] pe[2,pj rIVa {p) 

Sp - 5i) - RlE^ (p) 



(l + op(l)) max 

pe[2,p„] 



^^^A ip) 



+ 0p [l+fj' [Rl-Rl 



(l + Op(l)) max {sp} + Op(l). 

pe[2,p„] 



Hence (C.38) gives, since 7n — 7^ — )■ +00, 



Sp-SA/Rl-E^ip) 
P I max ^ ^— > 7J = P max {sp} > 7^ + o (1) = o (1) 



This ends the proof of the Proposition. 



□ 



C.7. Proof of Propositions B.4 and B.5 When studying the mean and variance of 5*^, 
we make use of Theorem 2.3.2 in Brillinger (2001) which implies in particular that, for any 
real zero- mean random variables Zi, . . . , Z4, 



Var (Z1Z2, Z3Z4) = Var(Zi, Z3) Var(Z2, Z^) + Var(Zi, Z4) Var(Z2, Z3) 

+ Cum(Zi,Z2,Z3,Z4). (C.39) 



Note that Assumption |R] and Theorem C.l imply that 



sup |r„ (0,^2, • • • ,tq)l < 00. 



"''jeP,-] t2,...,t^ = -oo 



(C.40) 



C.7.1. Proof of Proposition B.4 (C.39) yields 



E 



n-] 



1 



tl,t2=l 



^ (^i + + Rt2-H+jRt2-t,^3 + r (0, J, h - h, t2-ti+ j)) 



tl,t2 = l 



where 



tl,t2 = l 

Rt2-ti+jRt2-ti-j 

0*2 = 1 

^ r(o,j,t2-ti,t2-ti+j) 



n-jr-1 

(n-j)i?o + 2 5^ {n-j-i)R 



[n 



-j)R] + 2 J2 {n-j-i)Re+jRi 



1=1 



n-j 



tl,t2 = l 



n-j-l 

J2 {n-j-\i\)T{0,j,i,i + j) 

£=-n+j+l 



Set kj = (j/p) to prove the first equality and kj = (j/p) /t| for the second. Note that 
Assumptions K and [R| give, in both case, max^gji /cj < C and kj > CI{j <p/2). The 
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equalities above give 



E 



ra-l 



n-1 



n-l 



n-l 
n— 1 n— j— 1 



+ - 1- 



n / n 



n 



(C.41) 



j=l ^=1 ^ 

n— 1 j— 1 

j=l e=-n+j+l 
?2 v^n-l 



n 



J + 1^1 



n 



We start with the item i?Q Yl^=i ~ ^i' which is equal to -Rg-^ (p) when fc^ = ii'^ (j/p)? 



that is when proving the first equality. When kj = (j/p) (B.3) gives, under Assump- 
tions [K] and [R[ 



n-l 

«SE(i 



i=i j=i 



so that i?2 YJ'Zl (1 - j/n) fcj- >E{p)-C'. 



Let us now turn to the other items. The lower boundfcj > CI{j < p/2) gives that 



(C.41 ) is larger than Cn X]jf?i To bound the remaining terms in (C.41 ), we note that by 
Assumptions K, R|and (C.40), 

n-l n-j-1 ^ . 

3 + 



E%E 1- 

n~l n—j—1 , 

E% E 

j=i i=\ ^ 



n 



J + 



n 



Re 



n-l 



oin 



2 

'i' 



i=i 



Re+iR. 



E« 



+ 00 +00 



j=i e=i 



0=0 



n— 1 n—j—1 

E^. E (1 

j=l e=-n+j+l 



J + 



n 



<c Yl \^i^,t2,h,u)\<c 



uniformly with respect to p G [l,p„]. Substituting these bounds in the equality above 
establishes the proposition. □ 
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C.7.2. Proof of Proposition B.5. Let / be the spectral density of the alternative. Using 



(C.40), we obtain 



sup 1/ (A)| < C and < C 



(C.42) 



because sup,,[_,^] |/(A)| < (|i?o| +2Er=i l^^l) /(2^) ^7=1^1 < (^7=1 l^.l)'- We 
recall that Rj = Yl^=i UtUt+j/n and define Rj = K Rj = (1 — j/n) Rj. Set kj = K"^ (j/p) to 



prove the first equality and kj = K (j/p) /r for the second. Note that Assumptions K and 



R 



give, in both case, kj < CI {j < p). To avoid notation burdens, redefine Sp as X]j=i ^i^j- 



Define Dj = Rj - Rj. We have E [Dj] = and Sp = n kjR^^ + 2n kjRjDj + 
^E?=i^ kjD]. The inequahty (a + < 2a^ + 2b^ implies that 



^n-l 



n-1 



n-1 



Var (^Sp^ < 4 Var U ^ kjRjRj j + 2 Var n ^ kjD 



(C.43) 



i=i 



By identity (C.39) 



n— 1 \ n— 1 



Var I n ^ k^R^R^ J = ^ kj^kj^Rj^Rj^ ^ ^ Gov Mt^Wt^+jJ < 14 + i^i 



with 



n-1 



n-Ji n-j2 



jl J2 = l 

n-1 



ti=l i2=l 



^ kj^kj^Rj^Rj^ 5Z 5Z ^ (^1' + -^1' ^2 + J2; 

ilj2 = l tl = l t2 = l 



The second term on the right of (C.43) is, up to a multiplicative constant, equal to 



n-1 



n-1 



Var j n kjD 



2 1 2 

J = n 



E %/^.2Cov(D|^,D|J. 

ii j2=i 
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Applying (C.39) twice we obtain 



^ n-ji n~j2 



ti,t2=it3,t4=i Lg=i 

n-ji n-j2 



q=3 



i^ti+ji,ut,^ut,^+j2) Gov [ut^ut^+jj^, Ut^ut^+j^) 



tlii2 = l ^3)^4 = 1 



+ Gov {utj^Ut-^^jj^jUt^Ut^^j^) Gov {ut2Ut2+h^ ^t3^t3+i2)] 
H J Gum (tti^ttfj+ji, UtjUtj+j^, Mt3Mf3+j2, Uj^Uj^+jj) 



^11*2 = 1 431^4 = 1 



^ ^ {Rt2-tiRt2-ti+j2-jl + Rt2-ti-jiRt2-ti+j2 + r(tl,tl + jl,t2,t2 + j2)) 
tl = l t2 = l 

^ ji n-j2 

"7 Gum (n^jUf^+j^, UtjU^j+j^, Mt3Mt3+j2) ^t4^t4+i2) • 



^1)^2 = 1 431^4 = 1 



Since {a + h + cf < 8(0^ + 6^ + c^), we can write Var [n YJ^Zl kjD^j < 6V2 + K2 + GK'^ with 



n-l 



^ , /n-ji n-j2 

- J2 ^h^h Y^Y^Rt2-t^R. 

ii,i2=i \ \ti=i t2=i 

^ n-l n-ji n-j2 

- kj^kj,, Y Cum {ut 

1 '^ti+ji, ^t2^t2+jl5 '^t3'^t3+i25 '^t4'^t4+j2 

Jlj2 = l tl,t2 = l t3,t4 = l 

n-l /n-jin-j2 



''t2-tl+j2-jl 



K. 



+ ( ^ Rt2-ti-jiRt2- 

tl=l t2=l 



tl+j2 



il J2 = l 



^1=1 t2 = l 



Substituting in (G.43) shows that the proposition holds if the following inequalities hold: 



1^1 < Cn V R^ V2 < Cp, Ki <C, K'< C, K2 < C—. 
^-^ ■' n 

i=i 

We establish these inequalities in five steps. 
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Step 1: hound for Vi. We note that \Rj\ < \Rj\ and that under Assumption 



K 



< 



kj < C for all j. Using a covariance spectral representation Rj = J^^exp{±ijX)f{X)dX, the 



Cauchy-Schwarz inequality and (C.42), we obtain by Assumption K 

n— 1 n—ji n—j2 

h. h. . . R. . R 

31 




iiJ2=l ii=l i2=l 

2 

2 



/(Ai)/(A2)dAirfA2 

^^^1%%% 5^ $^e'*i^ie*(*i+^i)^=e-'*^^^e-'(*2+^'^)^^rfAidA2 
J J n.,j2=i ii=it2=i 



n-l 



n-1 



n-l 



^ ] ^ii%2-^il-^j2 ^ ] ^ ] ^t2-ti-jiRt2-ti+j2 
ilj2 = l <1=1 i2 = l 




n-l 



ra— ji n— 1 



n-j2 



X 



E ^i^^^-^ E e^*''^e^^*^"'''^V(Ai)/(A2)f/AirfA, 



ii=i 



ti=i 



i2=i 



t2 = l 



< 




— TT ^ —7T 



n— 1 n— j 

5^A;,i?,^e^*^ie^(*+^)^^ 
j=i t=i 



/(Ai)/(A2)rfAirfA2 < 



This establishes the bound for Vi. 



Step 2: bound for V2. We define ^2 = ^1 + ^2, j2 = ji + 32- By Assumption K and by 



(C.40) 



^ n-l /n-jin-j2 

—2 ^n^n YYRt2-t,R. 



t2~t\-jl+j2 



il J2 = l 

C 



.ii=l t2=l 



Jl=l 



00 / +00 

n 

j2'=-CO \ t2l=~00 



t2/+i2' I 



<Cpx[ Yl \Rt,Rt,+nRt2Rt2+n\ ]<Cp\ Y 1^*1 ) ^ 

\j2,tl,t2=-O0 



\t=—oo 
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^ n-1 /n-jxn-n 



t2-tl-jl-^t2-ti+j2 



j 1,32 = 1 

C 



Ul=l t2 = l 



+00 



t2l-jiRt2l+jl 



ii=i 



j2/=— 00 \ t2'=—00 



— ^P |-^ti-ji-^ti+ii+i2-^t2-ii-^t2+ii+i2 1 — l-Rti-Rti+i-Ri2-Rt2+jl 

i2)*l>*2=-00 j,ti,t2=-00 
\t=-oo / 

therefore V2 < Cp. 



S'tei) 3: bound for Ki. Define ^2 = ^1 + t- Assumption K, and (C.40) yield 

p 00 00 

jij2=li=-oo tl,t2,t3=-oo 



Step 4- bound for K'2. (C.40) gives 



^2<;^ E %i%2 X]X]|r(0,Jl,t2-tl,t2-tl+j2)| 



ii.i2=i 



hoo / CXD 



vtl = l t2 = l 



E E |r(o,ji,t,t + j2)l 

01,32=1 \t=-oo 
+00 00 

= ^ E E ir(o,ji,ti,ti+j2)r(o,ji,t2,t2+j2)l 

ii,i2=i ti,t2=-oo 

<c(' |r(o,t2,t3,t4)|] <c. 

\*2,t3,i4 = -00 / 

Step 5: bound for K2. Bounding K2 requires additional notation. First set t^ = ti+ji, 
^6 = ^2 + ji, tj = t^ + j2 and tg = ti + j2, and note that t^, . . . ,ts depend upon ti, . . . , ^4 
and ji, j2 only. For a partition i? = {Bi, i = 1, . . . , ds} of {!,..., 8}, define ds = Card 5, 
Tsiti, . . . ,ts) = 1 Cum (mj,^, g G -Bf) , and recall that Cum('Uf) = Eut = 0. Then the 
largest ds yielding a non- vanishing F^ is (is = 4. When rf^ = 4, i? is a pairwise partition 
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of {1, ... ,8} so that is a product of covariances. Let B be the set of indecomposable 
partitions of the two-way table 

1 5 

2 6 

3 7 , 

4 8 



see Brillinger (2001, p. 20) for a definition. Then according to Brillinger (2001, Theorem 
2.3.2), 



Cum (ut^ut,+j,,ut^ut2+j^,ut:,u^+j^,ut^ut^+j^) 



J2 rB{h,...,ts). 

BeB,dB=4: 



BeB BeB,dB<3 

Some properties of partitions in B are as follows. Call {1,5}, {2,6}, {3,7} and {4,8} 
fundamental pairs and say that a i?i in a partition B breaks the pair {1,5} if {1,5} is not a 
subset of Bi. Then partitions B & B are such that each B^ E B must break a fundamental 
pair. Note that fundamental pairs play a symmetric role. Since — tq is ji or j2 with 
vanishing kj^ or kj^ if ji or j2 is larger than p, the indexes tq and of a fundamental 
pair also play a symmetric role in the computations below. We now discuss the contribution 
to K2 of partitions of {1, ... ,8} according to the possible values 1, ... ,4 of ds- Due to 
symmetry, we only consider representative partitions for each case. 



Under Assumption K and (C.40), the case ds = ^ gives a contribution to K2 bounded 



by 



^ n-l n-jr'i n-j2 



ii,i2=i 



c 



J2 \T{0,t2-tu...,ts-h] 



ti,...,ts=-n 

00 



<- E |r(o,4...,t'3)|<-. 
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The case ds = 2 corresponds to {Card -Bi, Card -82} being {2,6}, {3,5} or {4,4}. These 
cases are very similar and we hmit ourselves to {2,6} and Bi = {1,2}. The corresponding 
contribution to K2 is bounded by 



^ n— 1 n—ji n— J2 



h,n=i 



c 



<— |r(o,t2-ti)r(t3-ti,...,t8-ti 



n 

tl,...,tg=— n 
n n 



<^ Y |r(o,t'2)r(t'3,...,t'8)l<^El^*l Y |r(o,t;-4...,t's-t'3: 



n 



t=-n t'^,...,t'g=-n 



ta)\<C, 



by Assumption K and (C.40). 

The case = 3 corresponds to {Cardi?i, Card B2, Card -83} being {2, 2, 4} or {2, 3, 3}. 
We start with Card-Bi = 2, Cardi?2 = 2 and Cardi?3 = 4. The discussion concerns the 
number of fundamental pair broken by B^. Note that the situation where -B3 breaks only 
3 or 1 fundamental pair is impossible. The case where B^ does not break any fundamental 
pairs corresponds to partitions that are not indecomposable, so that the only possible cases 
are those where B^ breaks 4 or 2 fundamental pairs. 

• B3 breaks 4 fundamental pairs. Consider B^ = {1,2,3,4}, B2 = {5,6} and B3 = 
{7,8}. The corresponding contribution to K2 is bounded by 



n-l 



^ ,0 ^ n-ji n-j2 

- Y ^h^n Y Y ^B{ti,...M 

il,i2 = l tl,t2 = l i3,t4 = l 

^ n-l n-ji n~j2 

- Y ^31^32 Y Y ^ i^^h-tuts-tuh-ti) Rt^_t,R 



t-i—td, 



<c-sup|i?jf Y |r(o,t2,t3,t4)| < c 



jlj2 = l 
2 



tl,t2 = l t3,t4 = l 
00 



t2i<3i*4 = — 00 



Pi 

n 



by Assumption K and (C.40). 

i?3 breaks 2 fundamental pairs. Take B^ = {1, 2, 3, 5}, B2 = {4, 6} and Bi = {7, 8}. 
The change of variables ^2 = ^2 + ^i, ^3 = ^3 + ti and = t'^ + t3 shows that 
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contribution to K2 is bounded by 



n-l 



J2 = l ti,t2 = lts,t4 = l 

^ n—1 n—ji n—j2 

- kj^kj^ Y Y ^{0,t2-ti,ts-ti,ji)R 



c 



jl J2 = l 
n-l 



00 



<-Yk\j2/p) Y |r(o,t2,i3,Ji)l Y \Rt',\xsnp\R,\<c 



J2 = l 



t4=-0O 



under Assumption K and (C.40). 

We now turn to the case Card-Bs = Cardi?2 = 3 and Cardial = 2. Observe that -B3 
or B2 must break 3 or 1 fundamental pair. The discussion now concerns the fundamental 
pairs which are simultaneously broken by B3 and B2. Note that and B2 cannot break the 
same 3 fundamental pairs because if it did, Bi would be given by the remaining fundamental 
pair in which case Bi cannot communicate with B2 or i?3, a fact that would contradict the 
requirement that the partition {Bi, B2, B^} is indecomposable. 

• i?3 and B2 break 3 fundamental pairs, 2 of which are the same. Take B3 = {1,2, 3}, 
B2 = {4, 5, 6} and Bi = {7, 8}. Using change of variables ^2 = ^1+ ^3 = ^1 + ^3 



and ^4 = ^3 + ^4, we can see that under Assumption K and (C.40) the contribution 
to K2 of this case is bounded by 

n-l n-ji n-j2 



Y Y Y ^B{h,...,t8) 

j'2 = l ti,t2 = l t3,t4 = l 

^ n-l n-ji n-j2 

^ Y ^n^n Y Y r(0,t2-ti,t3-ti)r(0,ti-t4+Ji,t2-t4+Ji)i?: 



< 



c 



ii .i2=i 

n-l 



ii 1*2=1 43,(4=1 



n 



Y K\jMK\j2/p)snp\mt2,h)\ \^{0,t'2A)\ Y \^^'.\^^ 



t2,t3 



ti—i-j 

n 



Note that the case where -B3 and B2 break 3 fundamental pairs with less than one 
in common is impossible. 
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The next case assumes that B2 breaks only 1 fundamental pair, which is also necessarily 
broken by since B2 must contain the remaining unbroken pair. 

• i?3 breaks 3 fundamental pairs and B2 breaks only 1 pair. Take B3 = {1,2,3}, 
B2 = {4,5,8} and B^ = {6,7} and consider a change of variables t2 = ti + t'2, 



ts = ti + and = ti + ji — t'^. Under Assumption K and (C.40), the contribution 
of this term to K2 is bounded by 

n-1 n-ji n-]2 



J2=l tl,t2 = l t3,t4 = l 

^ n-1 n-ji n-j2 



il,i2 = l tl,t2 = l <3,i4 = l 

n— 1 c 



ilj2=-oo 



f2,t3 = — 00 



i?3 and i?2 break only 1 pair. Note that i?3 and B2 cannot break the same pair 
because B\ must be the remaining pair and cannot communicate, so that the par- 
tition is not indecomposable. Hence all the partitions in this case are similar to 
53 = {1,2,5}, B2 = {3,4,8}, Bi = {6,7}. The change of variable ^2 = ^1+ t'^, 
^3 = —j2 + ^2 + ji + ^3 and t4 = ^3 — ^4 yields a contribution to K2 bounded by 

n— 1 ii n~j2 



E E E ^B{ti,...,t8) 

,j2 = l <l,t2 = l t3,*4 = l 

^ n— 1 fi— jl ri—j2 

^ E ^^i^j-^ E E r(o,t2-ti,ji)r(t3-t4,o,j2)i?. 



t3-i2+J2-Jl 



jl J2 = l 



^11*2=1 t3it4 = l 



<c E ir(o,t2,ji)l E ir(i4,o,j2)| E 1^*3! ^c'.D 

jl,t'2=-00 j2,t'^ = -00 



fg= — 00 
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